LARGE-SCALE  STRUCTURE  OF  THE  UNIVERSE: 
SIMULATIONS  AND  STATISTICS 


By 


MICHAEL  A.  JONES 


A DISSERTATION  PRESENTED  TO  THE  GRADUATE  SCHOOL 
OF  THE  UNIVERSITY  OF  FLORIDA  IN  PARTIAL  FULFILLMENT 
OF  THE  REQUIREMENTS  FOR  THE  DEGREE  OF 
DOCTOR  OF  PHILOSOPHY 

UNIVERSITY  OF  FLORIDA 


1997 


George  Clint 


cither  Oggie;  ^ 


Bill  Laswell; 


ACKNOWLEDGMENTS 


Most  importantly,  I wish  to  thank  my  parents.  This  is  really  dedicated  to  them, 
as  they  have  always  been  so  very  supportive  of  me. 

Despite  my  ascension  to  the  highest  Cabinet  post,  Slackmaster  General,  my 
friends  and  relatives  have  always  been  polite  and  encouraging.  My  advisor  Jim  Fry 
deserves  special  thanks  for  being  patient  and  friendly.  It  is  an  honor  to  have  such  an 
excellent  physicist  for  an  advisor.  Professor  Neil  Sullivan  (head  of  the  department!, 
the  professors  on  the  graduate  committee,  and  the  entire  staff  at  the  Physics  Depart- 
ment have  been  very  helpful  in  allowing  me  to  complete  my  degree.  Friends  from 
the  Physics  Department,  Erika  Kisvarsanyi,  Dave  Willmes,  and  Steve  Thomas,  have 
provided  assistance  (steaming  the  mad  cow).  Particular  thanks  arc  due  to  Chandra 
Chegireddy,  who  is  to  the  Physics  Department  what  Scottie  is  to  the  Enterprise. 

My  good  friends  Oggie  Jones  and  Bill  Hoston  share  the  credit  (blame)  for  in- 
fluencing me  to  do  physics.  Without  their  good  and  fun  company,  1 would  have 
bowed  out  of  technical  endeavors  long  ago.  As  I completed  my  dissertation,  several 
friends  taught  me  valuable  skills:  Deborah  and  Kerry  Duggan — how  to  ride  bikes, 
Karen  and  Bobby  Livingston — how  to  lift  weights,  Professor  Vincent  Rodgers— how 
to  place  theoretical  physics,  computers,  bass,  and  drums  into  a proper  perspective. 

This  research  has  made  use  of  NASA’s  Astrophysics  Data  System  Abstract  Ser- 
vice. This  document  was  typeset  using  plain  TfeX.  Computer  simulations  were  per- 
formed on  DEC  Alpha  workstations. 


TABLE  OF  CONTENTS 


ACKNOWLEDGMENTS 

ABSTRACT 

CHAPTERS 


REFERENCES 


BIOGRAPHICAL  SKETCH 


Abstract  of  Dissertation  Presented  to  the  Graduate  School 
of  the  University  of  Florida  in  Partial  Fulfillment  of  the 
Requirements  for  the  Degree  of  Doctor  of  Philosophy 

LARGE-SCALE  STRUCTURE  OF  THE  UNIVERSE: 
SIMULATIONS  AND  STATISTICS 


Michael  A.  Jones 
May,  1997 

Chairman:  J.  N.  Fry 
Major  Department:  Physics 

The  gravitational  formation  of  large-scale  structure  in  an  expanding  universe  is 
studied  within  the  framework  of  the  standard  hot  big  bang  model.  We  work  in  the 
scenario  whereby  small  fluctuations  in  the  initial  density  field  are  amplified  via  grav- 
itational instability,  generating  frothy,  sheet-like  structures  which  span  scales  from  a 
few  million  to  a few  hundred  million  light-years.  Perturbation  theory  is  utilized,  based 
on  the  equations  of  a collisionless  gravitating  fluid.  Using  an  efficient  approximation 
which  reproduces  essential  aspects  of  the  complete  model,  we  perforin  computer  sim- 
ulations of  large-scale  matter  and  galaxy  distributions.  Random  realizations  of  the 
initial  conditions  are  produced  with  varying  initial  statistical  properties,  and  the  fi- 
nal, evolved  matter  distribution  is  investigated.  The  power  spectrum  and  bispectrum 
(the  two-point  and  three-point  correlation  functions  in  Fourier  space)  are  measured  to 
statistically  characterize  the  spatial  fluctuations.  Perturbation  theory  is  tested,  and 
the  transition  from  the  linear  to  nonlinear  evolutionary  regimes  is  investigated.  It  is 
found  that  a perturbative  approach  can  be  selectively  used  beyond  its  expected  range 
of  applicability.  In  some  instances,  highly  structured,  yet  dilute,  galaxy  distributions 


id  utilizing  results  from  perturbation  theory. 


CHAPTER  1 

VISUALIZING  LARGE-SCALE  STRUCTURE: 

AN  ELEMENTARY  INTRODUCTION 

1.1  Introduction 

Cosmology  is  the  branch  of  astrophysics  concerned  with  the  observable  universe 
in  its  entirety.  The  following  chapters  present  computational  and  theoretical  work— 
as  opposed  to  observational/experimental  work — related  to  the  large-scale  structure 
problem  of  cosmology.  The  purpose  of  Chapter  1 is  to  introduce  the  concept  of  large- 
scale  structure  in  the  universe.  We  wish  in  Chapter  1 to  informally  describe  the 
matter  distribution  on  extragalactic  scales  and  the  big  bang  model  which  provides 
the  backdrop  for  the  clustering  of  matter.  Chapter  2 provides  some  technical  back- 
ground for  our  particular  area  of  research:  gravitational  instability  in  an  expanding 
universe,  studied  within  a Newtonian  approximation.  While  Chapter  1 offers  a visual 
or  intuitive  picture  of  cosmic  structure.  Chapter  2 introduces  the  correlation  statis- 
tics and  expanding  fluid  perturbation  theory  which  we  use  to  technically  describe  the 
structure  and  its  dynamics.  Further  details  of  the  perturbative  expanding  fluid  ap- 
proach are  described  in  Chapter  3,  including  the  simplifying  innovation  of  Zel'dovich 
(1970).  Results  from  computer  simulations  are  presented  in  Chapters  4 and  5.  Chap- 
ter 4 shows  examples  of  computer  simulated  structure  under  various  conditions,  while 
Chapter  5 provides  statistical  analyses  of  simulated  cosmic  structure.  We  close  with 


: final  remarks  in  Chapter  6. 


Bcvond  the  Night  Skv 


At  night,  we  may  look  out  and  perceive  the  vastness  of  interstellar  space.  On 
a clear  night,  we  might  see  several  thousand  stars  which  reside  in  our  neighborhood 
of  the  Milky  Way,  our  home  galaxy.  From  on  Earthly  point  of  view,  these  stars  are 
separated  by  truly  enormous  distances.  Yet  from  a cosmological  standpoint,  these 

on  the  largest  observable  scales,  one  must  penetrate  far  beyond  the  night  sky. 

It  is  fair  to  say  that  to  us  the  Sun  is  very  powerful  and  important.  Nonetheless, 
we  must  accept  that  it  is  simply  one  star  among  the  over  one  hundred  billion  stars 
which  inhabit  the  Milky  Way.  The  galaxy  is  an  interesting  and  complex  system,  but 
we  only  have  to  comprehend  it  in  a broad  sense  before  we  try  to  view  an  even  larger, 
more  inclusive,  cosmic  picture.  Our  galaxy  is  just  one  of  billions  of  galaxies  which 
we  can  observe  from  Earth.  Each  galaxy  can  be  thought  of  as  an  'Island  universe" 
containing  billions  of  stars.  Within  the  fields  of  astronomy  and  astrophysics,  the 
study  of  large-scale  structure  is  concerned  with  the  distribution  and  motion  of  matter 
on  cosmological  scales.  We  are  at  times  interested  in  galaxy-sized  regions.  More 
often  than  not  we  seek  out  the  far  reaches  of  intergalactic  space,  regions  containing 
hundreds,  thousands,  or  even  millions  of  galaxies.  Astronomical  data  show  that 
galaxies  are  not  simply  sitting  out  there  in  an  arbitrary  pattern.  Essentially  all 
other  galaxies  appear  to  be  receding  from  us,  a fact  which  is  a natural  ingredient  in 
expanding  universe  (or  "big  bang")  cosmologies.  Furthermore,  the  galaxy  positions 

galaxies  formed  in  the  first  place?  These  questions  lie  at  the  heart  of  the  large-scale 
structure  problem. 

There  is  a hierarchy  of  structure  in  the  universe.  Matter  clumps  together  in  stars, 
which  in  turn  reside  in  galaxies.  Then  there  arc  galaxy  clusters  which  range  from  small 


groups  to  rich  dusters  which  may  contain  a thousand  galaxies.  Even  galaxy  clusters 


are  found  to  be  grouped  together  into  superclusters  which  may  contain  thousands 
of  galaxies.  Attention  must  also  be  paid  to  the  possibility  of  dark  matter.  In  the 
simplest  sense,  dark  matter  refers  to  nouluminous  matter,  or  matter  that  we  cannot 
see  with  a telescope.1  In  the  same  sense  that  the  planet  Pluto  was  discovered  before 
it  was  seen,  there  is  matter  out  there  on  a cosmological  scale  which  we  detect  (by 
its  gravitational  influence)  but  can  not  sec.  Any  clumping  of  matter,  luminous  or 
dark,  on  extra-galactic  scales  is  customarily  referred  to  as  “structure"  or  "large-scale 

What  sort  of  physical  theories  will  our  discussion  of  cosmological  structure  for- 
mation rely  on?  In  order  to  imagine  a simple  scenario  which  describes  the  formation 
of  large-scale  structure — and  which  bears  some  resemblance  to  observable  reality — we 
must  proceed  within  the  framework  of  an  expanding  universe  model.  The  standard 
big  bang  model  of  cosmology  is  more  of  a framework  or  paradigm  than  a totally  spe- 
dfic  theory.  Nonetheless,  it  is  a broad  theory  which  (a)  makes  predictions,  (b)  can  be 
tested  against  observations,  (c)  can  be  bent  due  to  its  flexibility,  and,  importantly, 
(d)  can  conceivably  be  disproven.  But  right  now  it  is  really  the  only  player  in  the 
game,  and  so  we  have  to  go  with  it,  and  we  do  so  without  regret,  for  its  successes  are 
quite  remarkable.  We  can  think  of  the  standard  big  bang  model  as  taking  known  and 
trusted  theories  such  as  thermodynamics,  electromagnetism,  and  particle  physics  and 
placing  them  in  the  context  of  an  expanding  universe.  We  have  not  mentioned  gravity 
yet;  we  have  saved  the  most  important  for  last.  In  terms  of  interparticle  interactions 
(involving,  say,  protons,  neutrons,  and  electrons)  gravity  is  the  weakest  known  force, 
but  it  is  a long-range  force.  The  "strong  force”  (which  is  responsible  for  the  binding 
To  “see”  an  object  In  this  context  means  to  detect  its  radiation  in  any  region  of  the 


they  would  see  a well  defined  spiral  galaxy.  Of  course,  if  this  hypothetical  observer 
happened  to  photograph  our  galaxy  from  a side  view,  the  results  would  be  different. 
Two  such  side  views  are  shown  in  Figure  1-2. 

All  six  galaxy  images  shown  in  Figures  1-1  and  1-2  were  obtained  by  the  Au- 
tomatic Plate  Measuring  Survey,  a modern  astronomical  survey  containing  over  two 
million  galaxies.  Even  our  small  sampling  of  six  pictures  shows  a bit  of  the  diver- 
sity in  galaxy  types.  Most  of  the  galaxies  on  record  fall  into  two  broad  categories, 
spiral  and  elliptical.  For  example,  in  Figure  1-1  the  galaxy  NGC  3962  is  elliptical, 
and  NGC  2811  is  lenticular,  a subclass  of  spiral.  First,  we  note  that  such  classifi- 
cations are  not  merely  idle  categorization.  The  galaxy  classification  schemes  reveal 
complex  (and  not  yet  fully  understood)  correlations  between  galaxy  type  and  the 
spectroscopy,  chemical  composition,  age,  and  environment  of  the  galaxies.  Second, 
for  our  purposes  the  important  point  may  be  this:  It  is  remarkable  that  such  a huge 
population  of  galaxies  can  be  usefully  dissected.  Keep  in  mind  that  each  galaxy  rep- 
resents a complex  system  containing  many  stars  and,  to  varying  degrees,  dark  matter 
and  interstellar  dust. 

Though  our  galaxy  is  just  one  among  many,  we  need  to  discuss  the  local  envi- 
ronment a little  more  specifically  before  we  move  on.  In  particular,  to  appreciate  the 
magnitude  of  the  extragalactic  distance  scale,  we  need  to  review  the  situation  closer 
to  home. 

L12 — Our  Immediate  Neighborhood 

To  get  a feel  for  astronomical  distances,  we  will  start  small  and  then  advance  to 
the  cosmological  scale.  The  radius  of  our  Earth  is  about  6,400  kilometers.  As  was 
known  by  the  ancients  using  basic  measurements  and  trigonometry,  the  Earth-Moon 
separation  is  about  60  Earth  radii,  or  about  400,000  kilometers.  The  mean  Earth-Sun 


NGC  2811 


NGC  3200 


Figure  1-2.  A side  view  of  two  gnlaxies  from  the  Automatic  Plate  Measuring  survey. 
Credit:  Naim  et  al.  (1995). 

separation  is  about  150  million  kilometers.  Even  these  local  distances  can  be  hard  to 
comprehend  at  a glance.  The  speed  of  light  is  a familiar,  universal  standard  which 
will  make  our  distance  determinations  a little  more  intuitive.  Since  the  speed  of  light 
is  300  thousand  kilometers  per  second,  it  takes  light  about  1.3  seconds  to  travel  the 
400,000  kilometers  between  the  Earth  and  the  Moon.  So  we  refer  to  that  distance  as 
1.3  light-seconds.  Likewise,  radiation  from  the  Sun  takes  about  8.3  minutes  to  reach 
us  here,  so  the  Earth-Sun  distance  is  8.3  light-minutes.  The  major  axis  of  Pluto's 
elliptical  orbit  is  about  5.6  light-hours,  which  defines  a rough  size  to  the  solar  system. 
Since  the  speed  of  light  is  large  compared  to  current  rocket  speeds,  it  is  easy  to  see 

One  light-year,  which  is  approximately  equal  to  10i3  kilometers,  proves  to  be 
a convenient  astronomical  unit.  Though  ten  trillion  kilometers  seems  quite  large 
in  Earthly  terms,  it  is  still  easily  imaginable  using  the  familiar  concept  of  parallax. 
There  ore  hundreds  of  nearby  stars  whose  distances  can  be  measured  using  the  par- 
allax method  (Mihalas  Sc  Binnoy  1991,  Unsold  Sc  Baschek  1991).  Based  on  positional 


observations  spanning  six  months,  we  can  use  the  diameter  of  the  Earth's  orbit  about 
the  Sun  as  a baseline  for  a simple  trigonometric  calculation.  Using  this  technique, 
we  can  find  the  distance  to  our  nearest  neighbors,  a Centauri  and  its  dim  companion 
Proxima  Centauri,  to  be  4.3  light-years.  There  are  about  50  such  distance  determina- 
tions for  stars  within  about  15  light-years  of  us,  and  this  technique  can  be  used  with 
some  success  for  a thousand  or  so  stars  within  about  100  light-yeais.  Incidentally,  of 
these  stars,  some  can  be  seen  with  the  naked  eye,  and  some  are  too  dim  to  see  despite 
their  proximity.  In  astronomy,  we  divide  a circle  into  360  degrees,  and  1 degree  is 
further  divided  into  60  arc  minutes,  and  each  arc  minute  is  divided  into  60  arcsec- 
onds.  Proxima  Centauri  shifts  in  angular  position  by  1.52  arcseconds,  relative  to  the 
fixed  celestial  sphere  of  distant  stats  and  galaxies,  over  6 months  of  observation.  If 
we  take  into  account  that  the  Earth  has  shifted  by  about  16.6  light-minutes  during 
this  period,  then  we  deduce  the  4.3  light-year  distance.  In  addition  to  the  light-year, 
another  frequently  encountered  astronomical  length  unit  is  the  parsec,  which  is  about 
3.26  light-years  (an  object  one  parsec  distant  would  shift  in  apparent  position  by  an 
arcsecond  over  three  months).  With  the  new  orbiting  telescopes  and  adaptive  optics, 
which  avoid  the  smearing  effect  of  the  Earth's  atmosphere,  an  angular  resolution  down 
to  0.001  arcseconds  allows  one  to  measure  parallax  out  to  a few  thousand  light-years, 
still  well  within  the  bounds  of  the  Milky  Way. 

There  are  other  methods  available  to  find  distances.  Parallax  methods  can  be 
extended  by  comparing  measurements  taken  over  a period  of  twenty  years  or  more, 
accounting  for  the  solar  system’s  orbital  220  kilometers  per  second  motion  about  the 
galactic  center  (taking  roughly  200  million  years  per  orbit).  As  impressive  as  these 
important  parallax  methods  are.  they  still  only  probe  our  neighborhood  within  the 
Milky  Way.  Other  distance  methods  are  generally  variations  on  the  following  theme. 
If  we  have  two  standard  candles  (astronomical  objects  which  we  believe  to  have  equal 


intrinsic  luminosity),  and  one  is  twice  as  distant  as  the  other,  and  if  there  is  no 
obscuring  dust  in  the  way,  then  the  more  distant  standard  candle  will  appear  exactly 
one  fourth  as  bright.  This  is  an  oversimplification,  and  such  methods  are  generally 
quite  complicated  and  imprecise  in  practice.  One  type  of  star  in  particular,  known 
as  a Cepheid,  has  been  quite  reliable  as  a standard  candle.  There  are  other  standard 
candles  of  varying  reliability,  including  certain  types  of  stars,  dust  clouds,  supernovao. 
and  galaxies.  Over  the  years,  such  standard  candle  methods  have  become  fairly  robust 
within  our  galaxy  and  for  neighboring  galaxies.  Concerning  the  extragalactic  regions, 
there  are  still  discrepancies  of  a factor  of  two  in  these  complex  techniques. 

Essentially  all  stars  and  galaxies  which  we  observe  are  fixed  on  the  celestial  sphere, 
which  means  that  we  do  not  see  them  move  at  all,  even  though  we  repeatedly  observe 
them  over  months  and  years.  When  one  speaks  of  motious  within  our  galaxy,  and 
of  the  cosmic  motions  associated  with  large-scale  structure,  it  is  important  to  under- 
stand that  such  motions  are  impossible  to  see  by  trying  to  watch  things  move  through 
a telescope.  Effects  giving  rise  to  apparent  motion,  such  as  the  Earth's  rotation  and 
the  precession  of  the  equinoxes,  are  accounted  for  when  coordinates  for  a star  or 
galaxy  are  specified.  On  Earth  we  specify  a location  of  a city  with  two  angles,  the 
latitude  (measured  from  the  equator)  and  the  longitude  (measured  from  the  Green- 
wich meridian).  To  specify  the  location  of  a star  or  galaxy,  we  imagine  that  there 
is  an  infinitely  large  celestial  sphere  about  the  Earth  that  has  its  own  latitude  and 
longitude  lines  of  reference  drawn  on  it.  The  celestial  latitude  is  called  the  declination 
and  is  measured  from  the  projection  of  the  Earth's  equator  onto  the  celestial  sphere. 
The  celestial  longitude  is  called  the  right  ascension  and  is  measured  with  reference  to 
the  Sun's  celestial  meridian  at  the  vernal  equinox.  March  21,  of  some  reference  year, 
typically  1950  or  2000.  So  if  you  (1)  know  where  vour  telescope  is  located,  (2)  can 
describe  where  your  telescope  is  pointing,  and  (3)  know  what  time  it  is,  then  you 


can  unambiguously  specify  the  astronomical  coordinates  of  the  star  or  galaxy  in  the 
center  of  the  telescope's  field  of  view. 

1.3.3  The  Milky  Wav  and  the  Nearby  Universe 

The  luminous  disk  of  our  galaxy  is  about  100  thousand  light-years  in  diameter. 
Throughout  much  of  the  disk,  its  thickness  is  only  about  2.000  light-years,  although 
this  increases  at  the  centra!  bulge.  It  is  not  a trivial  task  to  describe  our  galaxy, 
mostly  because  it  is  quite  complex,  and  the  observational  data  arc  generally  not 
simple  to  interpret.  But  we  can  still  supply  a broad  view  which  will  help  orient  us 
toward  our  later  cosmological  perspective  (Mihalas  Sc  Binney  1991,  Peebles  1993). 
As  one  might  glean  from  Figures  1-1  and  1-2,  the  two  main  features  of  the  Milky 
Way  are  the  disk  and  the  spheroidal  component.  The  disk  consists  of  two  main 
populations.  The  main  stellar  population  is  smoothly  distributed  but  becomes  more 
concentrated  toward  the  galactic  center.  These  stars,  like  the  Sun,  are  fairly  aged 
(but  are  definitely  not  the  oldest  stars  around)  and  lie  close  to  the  central  galactic 
plane  and  travel  in  nearly  circular  orbits  about  the  galactic  center.  The  other  disk 
population,  associated  with  the  spiral  arms,  contains  a clumpy  distribution  of  young 
stars,  dust,  gas,  and  nebulae.  The  chemical  composition  of  the  disk  suggests  that 

stars.  The  spheroidal  component  can  be  broken  into  a few  sections.  The  galactic 
nucleus  is  the  dense,  violently  active  region  within  10  light-years  of  the  center.  The 
galactic  bulge  is  the  central  region  about  20  thousand  light-years  in  diameter.  The 
halo  is  the  blob  of  matter,  both  luminous  and  dark,  which  envelops  the  entire  disk, 
perhaps  even  extending  beyond.  Note  that  the  halo  is  not  actually  evident  in  the 
photographs  in  Figures  1-1  and  1-2;  only  the  increased  central  luminosity  and  bulge 
are  clearly  shown.  The  spheroidal  component  is  mostly  free  of  gas  and  dust,  and  its 


stars  are  generally  quite  old,  roughly  as  old  as  the  Milky  Way  itself.  Some  of  these 
stars  reside  in  globular  clusters  containing  up  to  100.000  stars.  There  arc  about  200 
such  globular  clusters  scattered  about  the  inner  galactic  halo. 

In  Figure  1-3.  we  provide  a rough  schematic  side  view  of  the  galaxy.  Note  that 
the  familiar  constellations  and  practically  the  entire  night  sky  we  observe  with  our 
naked  eye  are  contained  within  a few  hundred  light-years  of  us.  We  try  to  indicate 
this  on  the  figure,  but  keep  in  mind  that  in  actuality,  this  night  sky  region  comprises 
a fraction  of  a percent  the  galactic  volume.  There  are  a few  major  exceptions  to 
these  visual  limitations.  To  the  casual  observer  strolling  about  the  countryside  on  a 
clear  night,  the  "milky  way”  is  the  faint,  milky  ribbon  which  encircles  the  sky.  it  is 
brightest  toward  the  constellation  Sagittarius,  which  is  the  direction  to  the  galactic 
center.  We  are  seeing  the  effect  of  distant  stars  within  our  own  galactic  disk,  and  so 
we  get  an  inside,  panoramic  view  of  our  galaxy.  Another  important  yet  dim  night  sky 
object  is  the  Andromeda  nebula,  which  appears  as  a fuzzy  star  in  the  constellation 
Andromeda,  near  the  square  of  Pegasus.  Through  a telescope  (after  a conventional 
photographic  exposure  of  perhaps  a quarter  of  an  hour),  this  dim  entity  is  revealed  to 
be  a giant  spiral  galaxy,  similar  to  the  Milky  Way.  It  is  the  closest  significant  galaxy 
to  us,  being  only  2.5  million  light-years  distant.3  The  Large  and  Small  Magellanic 
Clouds  (seen  only  in  the  southern  sky)  are  small  irregular  galaxies  about  150  thousand 
light-years  from  us. 

All  galaxies,  including  our  own,  arc  dynamical  systems  which  evolve  throughout 
their  lifespan.  With  such  a large  sample  of  galaxies  available  for  observational  study, 
one  might  think  that  we  would  have  ample  opportunity  to  witness  galaxies  at  various 
stages  of  evolution.  Nearby  galaxies  arc  viewed  as  our  contemporaries,  as  the  light 

In  the  literature,  e.g.  Peebles  (1993),  this  result  is  quoted  with  a 4%  uncertainty.  Even 
accurate  distance  determinations  within  our  galaxv  are  uncertain  by  up  to  10-20%. 
Other  extragalactic  distances  are  generally  much  more  uncertain. 


12 


(thousands  of  light-years) 

Figure  1-3.  Schematic  side  view  of  our  galaxy. 

from  them  has  taken  only  tens  of  millions  of  light-years  to  reach  us.  More  distant 
galaxies  are  viewed  well  back  in  their  history,  as  their  light  has  been  traveling  hundreds 
of  millions  of  light-years  toward  us.  Although  evolutionary  effects  have  been  noted 
and  discussed  in  the  literature,  there  are  not  many  systematically  useful  results  that 
are  known.  Lack  of  progress  here  is  probably  related  to  the  lack  of  a specific  theory 

Nonetheless,  we  still  know*  a fair  amount  about  galaxies  in  general.  For  example, 
the  disk  material  does  not  rotate  together,  like  a rigid  body.  If  that  were  the  case, 
the  stars  and  gas  in  the  outer  regions  would  have  to  travel  faster  in  their  trajectories 
to  keep  up  with  the  inner  material.  What  happens  is  that  the  outer  matter  (which 
has  longer  to  travel)  gets  passed  up  by  the  inner  matter.  In  fact,  trajectories  depend 
on  the  mass  distribution  within  the  galaxy,  according  to  the  equations  of  gravity.  For 
example,  in  onr  solar  system,  the  planets  have  speeds  which  decrease  in  inverse  pro- 
portion to  the  square  roots  of  their  distances  from  the  Sun,  as  explained  by  Newton. 


13 

Such  a velocity  profile  (known  as  a “rotation  curve"  when  applied  to  galactic  matter) 
is  called  Keplerian.  In  many  galaxies  (including  our  own)  the  rotation  speeds  are, 
roughly  speaking,  fairly  uniform  throughout  the  disk,  so  we  would  say  these  galaxies 
have  fiat  rotation  curves.  For  spiral  galaxies,  this  is  only  part  of  the  story,  since  the 
spiral  arms  are  dynamically  evolving  density  enhancements  which  rotate  at  a slower 
rate  than  the  rest  of  the  disk  material. 

Looking  at  galaxies  from  a cosmological  perspective,  an  overall  picture  might  be 
something  like  this.  With  our  knowledge  of  the  stellar  populations,  gaseous  content, 
and  the  dynamical  information,  we  have  a clue  as  to  the  age  and  evolutionary  prop- 
erties of  galaxies.  So  we  claim  to  know  enough  about  galaxies  to  use  them  as  tracers 

As  our  final  point  concerning  galaxies,  we  mention  the  halo  and  dark  matter,  a 
matter  which  pertains  directly  to  the  large-scale  structure  problem.  Returning  to  the 
solar  system  trajectories,  we  can  say  that  the  reason  for  the  Keplerian  decrease  in 
planetary  speeds  is  that  the  trajectories  are  governed  by  the  large  central  solar  mass. 
The  other  masses  within  the  solar  system  only  perturb  this  primary  effect.  We  would 
not  expect  the  orbital  speeds  within  a galactic  disk  to  decrease  in  a Keplerian  fashion. 
However,  as  we  neared  the  edge  of  the  luminous  disk,  wo  would  expect  a characteristic 
decrease  in  the  rotation  speeds.  With  modern  finely  resolved  measurements  on  dozens 
of  galaxies,  we  are  able  to  obtain  spectrographic  data  spanning  the  various  regions  of 
individual  galaxies.  Such  spectral  data  can  be  analyzed  for  Doppler  shifts  from  which 
we  obtain  rotation  speed  profiles.'1  The  rotation  curves  appear  flat  even  beyond  the 
luminous  disk.  If  the  trajectories  are  governed  by  conventional  gravitational  physics, 

Shifts  in  the  spectral  pattern  of  galactic  electromagnetic  radiation  are  directly  related 
to  galactic  motion — this  is  analogous  to  the  acoustical  Doppler  shift  whereby  a mu- 
sical pitch  raises  and  lowers  according  the  motion  of  the  sound  source.  To  find  the 
velocity  patterns  within  a galaxy,  the  measurements  must  be  so  precise  as  to  show 
the  differences  in  Doppler  shift  between  various  regions  of  the  same  galaxy. 


then  the  emerging  picture  of  a galaxy  includes  a large  spheroidal  blob  (called  the 
halo)  containing  dark  matter  and  spanning  some  100-600  thousand  light-yeais. 

We  believe  that  all  matter  responds  to  the  gravitational  force  in  the  same  way. 
This  statement  would  apply  to  regular,  everyday  matter  (e.g.  protons  and  neutrons) 
of  course,  but  there  is  more  to  the  story.  The  universal  influence  of  gravitation  would 
also  include  unusual  matter  (e.g.  neutrinos,  which  have  been  known  to  the  nuclear 
and  particle  physicists  for  years)  and  exotic  matter  (e.g.  axions  and  other  currently 
undetected  particles).  We  know  there  is  dark  matter,  we  just  do  not  know  what 
it  is  made  of.  The  exotic  forms  of  dark  matter  arc  often  called  "nonbaryonic,”  as 
'barvonic"  is  a term  loosely  applied  to  protons  and  neutrons  (which  are  the  heaviest 
form  of  regular,  stable  matter).  Galactic  halos  may  be  made  from  one  type  or  from 
a mix  of  different  types  of  dark  matter.  The  issue  is  further  complicated  by  the  fact 
that  dark  matter  is  detected  on  larger,  cosmological  scales.  There  are  arguments  that 
some  of  the  dark  matter  must  be  nonbaryonic.  These  dark  matter  issues  seem  to 
span  a vast  range  of  distances.  But  the  galactic  scales  arc  almost  certainly  related  to 
the  larger  cosmological  scales.  Recall  that  our  galaxy,  as  well  as  all  the  other  galaxies 
and  even  larger  structures,  did  not  exist  far  into  the  past.  They  were  formed  within 
an  expanding  universe  that  was  much  smoother  (less  lumpy)  in  the  past.  The  dark 
matter  connection  is  simply  the  fact  that  all  these  structures  were  formed  as  a result 
of  gravitational  instability.  In  smaller  regions  ranging  from  stellar  to  galactic  halo 
scales,  electrodynamics  and  gas  dynamics  play  crucial  roles  which  further  complicate 


Structures  in  an  Expanding  Universe 


Ml  The  Big  Bang  Model 

In  the  previous  sections,  we  were  trying  to  find  a perspective  from  which  we  could 
appreciate  the  nature  of  extragalactic  data,  and  so  now  we  turn  to  the  larger  picture 
of  the  big  bang  and  cosmic  structure. 

All  distant  galaxies  are  receding  from  us.  Each  galaxy  has  a well  defined,  measur- 
able Doppler  shift  in  its  electromagnetic  spectrum.  If  there  were  no  relative  motion 
between  us  and  some  particular  galaxy,  then  we  would  detect  no  Doppler  shift  for  it. 
However,  essentially  all  galaxies  have  spectra  which  are  shifted  toward  longer  wave- 
lengths, In  the  visible  region  of  the  spectrum,  red  has  the  longest  wavelength  and 
blue  has  the  shortest  wavelength.  Technically  speaking,  we  say  that  all  galaxies  have 
positive  redshifts.  which  means  that  all  galaxies  arc  traveling  away  from  us.  This 
is  analogous  to  the  following  Earthly  experiment.  If  we  distributed  identical  tuning 
forks  to  various  people  and  then  asked  them  to  move  about,  we  could  analyze  their 
motions  from  the  various  pitches  we  heard.  If  all  notes  were  received  at  a lower  pitch 
than  the  standard,  that  would  mean  that  all  the  people  were  traveling  away.  More- 
over, the  faster  the  recession  speed,  the  lower  the  pitch  we  would  detect.  Similarly,  for 
galaxies,  we  say  that  the  amount  of  redshift  is  proportional  to  the  recession  velocity 
of  the  galaxy. 

If  we  trace  the  expansion  backward  in  time,  then  galaxies  used  to  be  closer  to- 
gether as  we  imagine  the  universe  5 billion  and  even  10  billion  years  ago.  We  can 
go  a bit  further  back,  but  then  we  are  not  really  talking  about  galaxies— rather  we 
have  oddly  shaped  galactic  blobs  and,  slightly  before  that,  protogalaxics  and  first 
generation  stars.  We  can  even  go  much  further  back  than  this,  still  within  the  realm 
of  a consistent  scientific  framework.  In  doing  so  we  are  simply  applying  standard 


distances  within  the  yeasty  fabric  have  doubled.  This  means  that  the  speck  twice  as 
distant  traveled  away  at  twice  the  speed,  in  analogy  with  Hubble's  law.  The  blatant 
flaw  in  this  bread  baking  analogy  is  that  the  bread  was  expanding  into  something. 
Because  general  relativity  is  describing  the  most  primitive  of  entities,  spacetime  itself, 
the  entire  picture  can  be  difficult  to  imagine.  The  big  bang  happened  everywhere;  no 
one  place  is  the  origin  of  it  all.  Present  astronomical  observations  are  consistent  with 
an  infinite  universe,  which  may  further  confound  one's  imagination. 

These  ideas,  as  difficult  os  they  are  to  visualize,  can  be  tested.  For  example, 
the  particular  way  in  which  the  galaxies  in  a sample  become  dimmer  with  distance 
can  be  used  as  a test  as  to  determine  whether  the  apparent  recessions  are  due  to 
an  expanding  spacetime  metric  or  some  other  unusual  effect.  Only  recently  have  the 
data  become  sufficiently  precise  so  as  to  verify  the  reality  of  the  expanding  metric  in 
this  way.7  But  such  verification  came  with  little  fanfare,  for  the  cosmic  background 
radiation  discovered  in  the  1960s  had  provided  data  which  can  only  be  explained  by 
the  expanding,  cooling  spacetime  model.  Perhaps  there  is  an  easier  way  to  come  to 
terms  with  the  expansion  phenomenon,  using  our  knowledge  of  the  speed  of  light. 
Since  the  recession  velocity  is  proportional  to  distance,  some  of  the  most  distant 
galaxies  we  observe  have  recession  velocities  in  excess  of  the  speed  of  light.  But  does 
not  that  violate  the  cosmic  speed  limit,  a foundation  of  Einstein’s  theories?  Not 
at  all.  for  even  two  widely  separated  galaxies,  apparently  moving  relative  to  each 
other  at  super-light  speeds,  are  simply  sitting  there,  approximately  at  rest,  in  their 
neighborhoods  of  space.  The  apparent  recession  speed  is  only  the  result  of  the  space 
between  expanding— by  the  laws  of  general  relativity,  the  wavelength  of  traveling  light 
expands  along  with  the  space,  and  this  change  of  wavelength  is  perceived  as  a Doppler 

' Hubble's  protege  Alan  Sandage  recently  published  an  account  of  this  surface  bright- 
ness test  and  other  fundamental  considerations  in  an  article  entitled  “Evidence  That 
the  Expansion  Is  Real”  (Sandage  1992). 


shift.  If  a galaxy  were'  moving  away  with  an  actual  physical  velocity  faster  than  the 
speed  of  light,  we  would  never  be  able  to  observe  the  light  from  it.  On  the  other 
hand,  if  all  galaxies  were  just  sitting  there,  there  would  be  no  structure — matter  has 
to  move  around  in  order  to  become  clustered,  and  such  motion  is  generally  a small 
but  important  addition  to  the  overall  Hubble  flow  of  galaxies. 

The  cosmic  background  radiation  is  the  earliest  direct  relic  we  can  observe  from  a 
hot  universe.  The  radiation  we  arc  now  receiving  was  emitted  when  the  temperature 
(as  defined  by  the  thermal  bath  of  electromagnetic  radiation)  was  about  3000  K, 
some  10  billion  years  ago,  just  a few  hundred  thousand  yeais  after  the  expansion  of 
space  began.  To  get  a handle  on  this  topic,  we  have  to  trace  backward  to  within 
minutes  after  the  beginning  of  expansion.  As  we  ponder  the  state  of  the  universe  at 
various  stages  back  to  several  billion  years  into  the  past,  we  at  least  have  recourse  to 
a certain  amount  of  basic  intuition.  For  during  this  time,  galaxies  existed  in  various 
states  of  evolution,  and  some  of  the  matter  was  flowing  smoothly  into  structures.  The 
ambient  temperature  at  the  stage  of  galaxy  formation  was  a cool  100  K or  so.  But  as 
we  probe  deeper  into  the  past,  macroscopic  structures  cease  to  exist,  and  we  enter  the 
realm  of  atomic  physics  with  only  the  two  lightest  elements  hydrogen  and  helium  and 


some  radiation  in  appreciable  abundance,  and  temperatures  are  in  the  thousands  of 
degrees;  the  universe  is  a hot  gas.  Earlier,  on  the  order  of  a second  after  the  big  bang, 
temperatures  are  on  the  order  of  billions  of  degrees,  and  the  fundamental  processes 
of  particle  physics  rule  the  cosmic  slop  of  protons,  neutrons,  electrons  and  neutrinos 
(and  their  antiparticles),  and  photons  (the  quantum  of  electromagnetic  radiation). 

When  all  physical  characteristics  of  a radiating  system  except  the  ambient  tem- 
perature have  been  washed  out,  generally  because  of  multiple  interactions  or  “ther- 
malization,1'  we  say  the  system  emits  blackbody  radiation  of  a certain  temperature. 


In  the  early  1900s.  Planck.  Einstein,  and  others  demonstrated  that  light  (i.e.  electro- 
magnetic radiation)  is  quantized  into  particles  which  carry  energy  in  inverse  propor- 
tion to  the  wavelength  of  the  light.8  A blackbody  gives  off  radiation  at  all  different 
wavelengths,  but  the  wavelengths  arc  distributed  with  a spectrum  (called  the  Planck 
distribution)  uniquely  defined  by  the  temperature.  For  example  the  Sun’s  thermal- 
ized  outer  regions  emit  an  approximate  blackbody  spectrum  corresponding  to  about 
6000  K with  all  wavelengths  represented  but  with  the  peak  intensity  lying  in  the  visible 
region  (wavelengths  of  about  a hundredth  of  a millimeter).  The  cosmic  background 
radiation  is  distributed  exactly  according  to  a Planck  distribution  corresponding  to  a 
temperature  of  2.7  K.  with  peak  intensity  lying  in  the  microwave  region  (at  millimeter 
wavelengths). 

So  from  all  directions  of  the  sky  we  arc  being  bombarded  with  a very  uniform 
low  energy  background  radiation  corresponding  to  about  400  million  photons  per 
cubic  meter,  detectable  only  by  sensitive  instruments.  What  is  the  density  of  regular 
matter  in  our  part  of  the  universe?  Roughly  speaking,  just  by  counting  the  thousands 
of  galaxies  near  us  (the  mean  separation  between  galaxies  being  some  15  million  light- 
years),  we  can  estimate  the  average  baryon  (neutron  or  proton)  density  to  be  on  the 
order  of  one  per  cubic  meter.  Although  the  photons  far  outnumber  the  baryons, 
presently  the  radiation  contributes  a negligible  portion  of  the  overall  energy  in  the 
universe  (the  jargon:  presently  the  universe  is  “matter  dominated”).  Far  into  the 
past,  say  for  the  first  10  thousand  years  or  so,  when  temperatures  were  on  the  order 
of  thousands  or  millions  of  degrees,  the  energy  of  radiation  comprised  most  of  the 
universe's  energy  (the  universe  was  “radiation  dominated”).  These  photons  have 
traveled  long  and  far  to  reach  us,  and  they  began  their  journey  during  an  earlier, 
hotter  epoch. 

So  an  x-ray  photon  carries  more  energy  than  a visible  light  photon  which  carries  more 
energy  than  a microwave  photon,  etc. 


21 

By  number,  barvons  are  presently  about  a billion  times  less  populous  than  the 
cosmic  photons,  but  the  baryons  are  now  playing  the  crucial  role  in  our  universe.  From 
complex  astronomical  analyses,  we  detect  that  ordinary  matter  is  presently  made  up 
of  about  three  quarters  hydrogen  and  one  quarter  helium,  and  such  abundances  are 
one  of  the  great  predictions  of  the  big  bang  model.  When  the  temperatures  are  on 
the  order  of  1010  K,  less  than  a second  after  the  universal  expansion  began,  protons 
and  neutrons  could  not  even  consider  joining  up  to  form  stable  atomic  nuclei.  A little 
bit  later,  within  the  first  three  minutes  or  so.  simple  nuclear  reactions  were  able  to 
transpire.  But  the  uature  of  the  interactions — involving  only  the  simplest  combina- 
tions of  small  numbers  of  protons  and  neutrons — and  the  cooling  and  dilution  of  the 
universe  allowed  only  a brief  opportunity  for  nucleosynthesis.  Difficult  calculations, 
first  performed  in  the  19fi0s,  provided  unambiguous  predictions  concerning  elemental 
abundances;  the  success  of  these  predictions  is,  in  a sense,  an  even  deeper  probe  back 
into  time  than  the  cosmic  background  radiation.  As  an  important  note  concerning 
galaxies  and  humanity,  we  point  out  that  although  the  present  baryon  population  is 
primarily  hydrogen  and  helium,  about  2%  of  regular  cosmic  matter  is  comprised  of 
heavier  elements  such  as  carbon  and  oxygen.  But  these  important  elements  were  not 
constructed  during  the  early  universe;  they  were  synthesized  later  in  stellar  nuclear 

— From_garly  Microscopic  Processes  to  the  Recent  Formation  of  Galaxies 

Many  of  the  events  of  the  hot,  early  universe  have  had  great  influence  on  the 
present.  We  will  just  touch  on  a few  things,  primarily  to  emphasize  the  importance 
of  the  hydrogen  and  helium  synthesis  and  to  portray  the  atmosphere  leading  to  the 
epoch  of  structure  formation. 


22 

Throughout  much  of  the  history  of  the  early  universe,  matter  and  radiation  were 
in  approximate  thermal  equilibrium.  For  example,  less  than  a second  after  the  big 
bang,  the  universe  was  merely  a hot  (over  1010  K)  gas  of  particles  which  was  quite 
dense  (over  a million  times  denser  than  ordinary  water)  and  behaved  like  a sea  of 
radiation.  These  particles  were  thermalixed  because  of  the  many  interactions  between 
the  photons,  olectrons/antiolectrons.  neutrinos/antineutrinos,  and  the  very  small  but 
important  population  of  neutrons  and  protons. 

To  understand  these  early  times,  we  need  to  keep  in  mind  the  equivalence  of 
mass  and  energy.  Mass  is  the  source  of  gravitation;  but  since  Einstein's  time,  our 
interpretation  of  this  has  become  more  general.  Any  lump  of  matter  or  pure  energy 
(e.g.  electromagnetic  radiation  or  even  more  exotic  types  of  energy)  participates  in 
gravitational  forces.  The  universe  is  expanding,  but  the  expansion  is  getting  ever 
slower.  In  observations,  we  cannot  see  this  directly  (we  would  have  to  wait  billions  of 
years),  but  the  data  imply  it.  The  reason  for  this  deceleration  is  similar  to  the  reason 
that  a rock  thrown  upward  from  Earth  slows  down  (note  that  the  rock  eventually 
comes  back  down  unless  it  has  reached  the  Earth's  escape  velocity).  The  400  million 
photons  per  cubic  meter  (at  2.7  K)  contribute  a much  smaller  energy  density  compared 
to  the  approximately  1 baryon  per  cubic  meter.  So  the  baryons  have  a much  larger 
effect  on  slowing  the  expansion  of  space,  according  to  the  general  theory  of  relativity. 
In  the  early  universe,  various  forms  of  hot  radiation  possessed  sufficient  energy  density 
to  be  a primary  factor  in  the  cosmic  deceleration.  During  the  radiation  dominated 
era,  photons  were  by  far  the  most  populous  species,  accompanied  by  a smaller  amount 
of  neutrino  radiation  and  a small  but  crucial  population  of  electrons  and  light  nuclei 
(hydrogen  and  helium).  This  era  lasted  about  10  thousand  years,  starting  just  a 
few  minutes  after  the  big  bang.  In  older  literature,  the  early  universe  is  sometimes 


referred  to  as  the  primeval  fireball,  and  this  is  a fitting  description  of  a very  different 


At  an  age  of  about  a tenth  of  a second,  the  universe  consisted  of  particles,  in 
equilibrium,  zipping  around  at  the  speed  of  light.  Conceptually,  this  environment  is 
fairly  simple.  Only  a small  part  of  the  huge  zoo  of  elementary  particles  is  present  at 
this  stage  of  the  universe.  A temperature  of  more  than  10lu  K means  that  the  particles 
are  interacting  at  high  energies — energies  which  are  larger  than  the  rest  energy  of 
an  electron/antielectron  pair.  This  means  that  electron/antielectron  pairs  can  be 
created  and  destroyed  at  will  during  fundamental  interactions  involving  photons. 
This  is  an  example  of  Einstein's  mass-energy  equivalence  at  work.  But  the  energies 
are  relatively  low  in  the  sense  that  other  elementary  particles  are  for  the  most  part 
not  in  existence,  as  they  have  higher  threshold  energies  (i.e.  they  arc  more  massive). 
We  know  that  a population  of  neutrinos  exists  at  this  time,  as  they  are  a stable, 
practically  massless,  particle.  Although  neutrinos  arc  weakly  interacting,  at  these 
densities  they  are  thermalized  with  the  other  particles. 

Last  but  not  least,  we  have  the  protons  and  neutrons.  The  neutrons  are  not  stable 
(they  decay  in  minutes  on  Earth),  but  in  this  environment,  elementary  interactions 
can  continually  interchange  the  protons  and  neutrons.0  A priori,  we  might  not  have 
reason  to  include  the  baryons  in  the  picture,  but  the  empirical  evidence  (the  current 
photon  to  baryon  ratio  of  about  a billion  to  one,  the  predominance  of  matter  over 
antimatter,  and  the  fact  that  we  exist)  says  that  we  should.  Neutrons  are  slightly  more 
massive  than  protons,  so  as  the  universe  expands  during  the  initial  few  minutes,  it 


over  neutrons.  By  this  time,  the  temperature  is  low  enough  that  electron/antielectron 
pairs  cannot  be  created,  so  their  population  is  decimated  as  they  decay  into  photons. 


orable  for  the  baryon  reactions  to  lead  to  an  excess  of  protons 


For  example,  we  have  the  processes  p + v ■ 


but  the  helium  nucleus  (two  protons  and  two  neutrons)  is  the  only  stable  combination 
synthesized  to  an  appreciable  amount.  So  after  a few  minutes  of  universal  expansion, 
the  baryons  are  in  the  form  of  protons  (i.e.  hydrogen  nuclei)  and  helium  nuclei.  The 
excess  of  protons  over  neutrons  leads  to  a robust  prediction  of  about  75%  hydrogen 
and  25%  helium  (there  is  a leeway  of  several  percent  in  the  calculations). 

The  above  examples  illustrate  a certain  type  of  early  universe  phenomenon. 
Stages  of  approximate  equilibrium  are  interrupted  by  important  deviations  from  equi- 
librium. The  reduction  of  the  neutron  population  and  the  freezing  out  of  the  elec- 
tron/antielectron pairs  cited  above  are  fairly  typical  examples.  Generally  there  is  a 
tension  between  the  rates  of  fundamental  processes  and  the  rate  of  the  Hubble  ex- 
pansion during  the  early  universe.  The  rates  of  particle  interaction  depend  upon  the 
density  of  particles,  the  strength  of  interactions,  and  the  particle  energies,  whereas 
the  spatial  expansion  dilutes  the  particle  population  and  lowers  the  energies  of  inter- 
action among  particles.  In  a qualitative  sense,  we  can  see  why  there  was  only  a brief 
opportunity  for  atomic  nuclei  to  form:  early  on  things  were  too  hot  and  dense,  later 
on  the  environment  was  too  dilute. 

As  odd  as  it  seems,  the  events  of  the  early  univeise  were  changing  by  the  mil- 
lisecond or  by  the  minute.  Today  one  would  have  to  wait  billions  of  years  to  see 
change  of  such  magnitude.  During  the  early  times,  gravity  was  important  only  in  the 
most  global  sense — the  energy  density  of  particles  dictated  the  expansion  rate  of  the 
universe.  It  was  the  microscopic  interactions  of  elementary  particles  which  proved 
most  influential. 

Today,  as  we  look  at  the  matter  beyond  our  galaxy,  we  see  a fair  amount  of 
structure.  But  if  we  gloss  over  the  structure,  we  sec  a universe  which  seems  somewhat 
homogeneous  overall.  In  the  far  past  (e.g.  during  the  era  of  nucleosynthesis)  the 


25 

universe  was  very  uniform,  however.  The  overall  average  density  of  the  universe 
has  been  continually  decreasing.  After  nucleosynthesis,  the  mean  density  (of  the 
radiation)  was  on  the  order  of  10  grams  per  cubic  centimeter,  or  ten  times  that  of 
water.  Currently  the  overall  cosmic  density  is  about  10-3°  grams  per  cubic  centimeter 
(or.  as  we  have  been  saying,  about  one  hydrogen  atom  per  cubic  meter).  As  far  as 
we  can  tell,  there  have  always  been  fluctuations  about  the  average  density,  and  these 
fluctuations  have  grown  over  time  because  of  the  force  of  gravity.  Presently,  we  can 
perform  galaxy  counts  in  a hypothetical  cube  of  length  20  million  light-years  per  side 
and  compare  the  number  of  galaxies  contained  in  the  cube  as  we  move  it  around 
the  sky.  We  find  that  there  are  significant  fluctuations  in  the  galaxy'  counts  in  such 
an  experiment.  If  we  could  possibly  perform  this  same  experiment  some  10  billion 
years  ago,  we  would  find  that  the  galaxy  counts  would  have  smaller  variation — i.e.  the 
density  fluctuations  have  grown  over  time.  In  considering  such  density  experiments,  it 
is  only  meaningful  to  consider  cosmic  scales,  as  the  local  densities  in  our  solar  system 
or  galaxy  are  huge  compared  to  the  mean  large-scale  density.  The  intricacies  of  our 
solar  system  or  the  spiral  arms  of  a galaxy  are  considered  to  be  fine  details;  they  are 
independent  systems  which  are  not  directly  related  to  the  overall  evolution  of  cosmic 
fluctuations.  This  is  just  another  way  of  saying  that  the  large-scale  structure  problem 
deals  with  regions  larger  than  galaxies. 

The  executive  summary  would  sound  like  this.  The  universe  has  an  average  den- 
sity which  steadily  decreases  due  to  the  expansion  of  space.  Density  fluctuations  have 
been  amplified  due  to  gravitational  instability,  and  the  growth  of  these  fluctuations 
has  led  to  the  structure  which  we  observe  today. 

The  universe  was  highly  uniform  early  on,  but  already  at  that  time  there  were 
small  density  fluctuations.  Naively,  we  might  think  that  areas  of  slightly  higher  den- 
sity would  attract  matter  to  form  lumps  which  could  further  attract  matter,  because 


26 

of  the  force  of  gravitational  attraction.  And  the  slightly  underdense  regions  would 
lose  matter  which  might  flow  toward  the  clustered  matter.  This  idea  is  not  too  far 
off  the  mark.  Isaac  Newton  considered  the  point  in  a cutsory  fashion  three  hundred 
years  ago.  Early  in  this  century.  James  Jeans  made  some  quantitative  estimates  con- 
cerning the  way  in  which  a large-scale  (astronomical)  fluid  might  break  into  lumps 
via  gravitational  instability.  This  work  could  be  updated  to  cosmic  proportions,  and 
in  1946  E.M.  Lifshitz  made  some  headway  within  the  context  of  the  general  theory  of 
relativity.  The  modern  era  in  cosmological  structure  formation  analysis  was  primarily 
led  by  Ya.B.  Zcl’dovich  in  Russia  and  P.J.E.  Peebles  in  the  United  States,  beginning 
in  the  1960s.  Largely  through  their  leadership,  a picture  has  emerged  over  the  last 
thirty  years  which  may  yet  gel  into  a coherent  theory.  One  remarkable  point,  which 
has  added  much  excitement  within  the  scientific  community,  Is  that  the  theory  and 
the  observational  data  have  both  been  improving  steadily  over  this  period. 

Consider  the  universe  when  it  was  much  smaller  and  hotter,  say  sometime  after 
nucleosynthesis,  during  the  radiation  dominated  era.  Density  fluctuations  were  al- 
ready present,  but  they  were  not  yet  able  to  grow,  because  of  the  stultifying  effects  of 
radiation  pressure.  The  gravitational  properties  of  the  universe  would  be  determined 
by  the  gas  of  radiation.  The  radiation,  like  any  form  of  mass  or  energy,  would  feel 
the  urges  of  gravitation,  but  it  also  would  feel  the  effects  of  its  own  pressure  forces, 
much  like  a familiar  fluid.  The  effects  of  fluctuations  would  be  reduced  to  a minimal 
sloshing  around  of  the  photons  and  baryons,  a photon  gas  version  of  a sound  wave. 
Any  nonbaryonic  dark  matter  present  would  follow  the  lead  of  the  radiation.  How- 
ever, after  about  10  thousand  years  of  radiation  domination,  the  temperature  will 
have  cooled  to  the  extent  that  matter  comprises  a larger  component  of  total  energy 
density.  Matter  fluctuations  are  free  to  grow.  Thus  began  the  present  era,  known  as 
the  matter  dominated  era. 


27 

The  last  important  event  before  structure  formation  came  into  full  swing  occurred 
about  300  thousand  years  after  the  big  bang.  At  this  time  the  universe  consisted  of 
photons  (which  no  longer  dominated),  neutrinos  (which  had  been  freely  streaming 
since  the  era  of  nucleosynthesis  due  to  their  very  weak  interactions),  hydrogen  and 
helium  nuclei  (positively  charged,  as  the  universe  had  always  been  too  hot  for  elec- 
trons to  join  with  them  into  neutral  atoms),  and  electrons  (negatively  charged  light 
weight  particles  waiting  to  combine  with  the  nuclei).  At  about  the  300  thousand  year 
mark,  the  temperature  had  lowered  to  about  3000  K,  low  enough  for  the  electrons  to 
safely  combine  with  the  nuclei,  thus  forming  neutral  atoms.  This  transitional  period 
is  traditionally  called  “decoupling.”  Before  this,  matter  and  radiation  were  closely 
coupled  (and  in  fact  were  at  the  same  temperature)  because  of  the  frequent  photon- 
electron  interactions.  After  the  neutral  atoms  form,  these  scattering  interactions  are 
effectively  silenced.  At  this  point,  the  universe  is  transparent  to  the  hot  photons,  and 
they  travel  freely  for  the  rest  of  time.  This  is  the  faint  radiation  we  are  presently 
detecting  from  all  directions  of  the  sky.  The  photons  were  thermalized  early  in  the 
universe,  and  so  the  radiation  emitted  during  the  decoupling  period  was  blackbody 
radiation  corresponding  to  a temperature  of  about  3000  K.  The  expansion  of  space 
in  the  intervening  10  billion  years  or  so  has  simply  redshifted  the  light  to  longer 
wavelengths,  corresponding  to  a photon  temperature  of  2.7  K. 

The  final  piece  in  the  puzzle  which  authenticates  the  paradigm  of  big  bang  struc- 
ture formation  was  discovered  by  NASA’s  COBE  satellite  in  1992.10  Very  soon  after 
entering  orbit  and  starting  operations,  the  satellite  observatory  was  able  to  dramat- 
ically update  the  accuracy  to  which  we  knew  that  the  cosmic  background  radiation 
was  exactly  a blackbody  radiation.  After  some  more  years  of  data  taking  and  process- 
ing, COBE  was  able  to  detect  slight  fluctuations  in  temperature  (1  part  in  100,000) 
10  COBE  stands  for  COsmic  Background  Explorer. 


between  various  parts  of  the  sky.  Recall  that  the  fluctuations  in  matter  density  had 
started  to  grow  ever  so  slightly  by  the  time  decoupling  occurred.  These  small  fluc- 
tuations left  a statistical  imprint  on  the  sea  of  radiation  just  before  it  decoupled. 
Cosmologists  had  been  expecting  the  experimental  physics  groups  to  detect  these 
temperature  fluctuations  for  many  years.  But  the  measurements  had  not  been  quite 
sensitive  enough.  The  data  arc  quite  impressive  in  the  sense  that  the  experimenters 
detected  some  cosmic  noise  superimposed  onto  their  detector’s  instrumental  noise. 
There  was  a big  hubbub  in  the  news  media  at  this  time  (April,  1992)  because  this 
event  demonstrated  the  existence  of  some  sort  of  statistical  structure  in  the  fluctua- 
tions of  the  truly  ancient  cosmic  radiation. 

The  question  naturally  arises:  Where  did  these  fluctuations  originally  come  from? 
One  very  conservative  approach  is  to  simply  sidestep  the  question.  In  order  to  provide 
more  focus,  we  narrow  the  discussion  so  as  to  consider  just  the  growth  of  fluctuations 
since  decoupling,  and  we  treat  the  statistical  characterization  of  the  early  fluctuations 
as  initial  conditions.  This  is  a standard  approach.  Another  approach,  common  since 
the  early  1980s,  has  been  to  extrapolate  the  laws  of  physics  back  into  the  very,  very 
early  universe  in  order  to  seek  a deeper  understanding  of  the  basic  initial  conditions 
of  the  big  bang  model.  The  large-scale  structure  problem  in  the  simplest  sense  is 
concerned  only  with  the  growth  of  small  fluctuation  into  present  day  structures.  Such 
growth  transpires  via  gravitational  instability  only  after  the  universe  has  entered  the 
matter  dominated  era.  The  COBE  satellite  data  provide  us  with  a direct  link  to  these 
fluctuations.  That  is  why  the  decoupling  transition  provides  a natural  boundary  for 
the  large-scale  structure  problem. 


Li! Slni.ct.urcs 


It  is  not  possible  to  get  a picture  oflarge-scale  structure  by  simply  looking  through 
a telescope.  The  distribution  of  matter  has  to  be  mapped  out,  piece  by  piece.  Such 
mapping  has  been  going  on  for  over  fifty  years,  and  only  during  the  last  twenty  years 
or  so  has  a reasonable  picture  of  cosmic  structure  begun  to  accumulate  in  the  data. 

There  are  about  20  galaxies  within  about  15  million  light-years  of  us.  This  cloud 
of  loosely  knit  galaxies  can  be  thought  of  as  a somewhat  isolated  gravitational  system, 
called  the  local  group.  The  Milky  Way  and  Andromeda  are  its  two  largest  galaxies;  the 
other  galaxies  are  merely  satellites  or  dwarfs.  About  half  of  these  galaxies  (including 
Andromeda)  possess  blueshifts  rather  than  redshifts,  because  the  Doppler  shift  is 
mostly  showing  their  local  movements  rather  than  the  Hubble  flow.  If  we  do  not 
consider  our  neighboring  galaxies,  then  we  can  say  that  all  galaxies  for  which  we 
have  Doppler  measurements  are  redshifted,  which  means  that  they  all  are  at  least 
approximately  following  the  outward  Hubble  flow. 

Beyond  the  local  group,  many  clusters  containing  10-100  galaxies  have  been  ob- 
served. During  the  1950s,  the  Palomar  Sky  Survey  data  were  used  to  catalogue 
thousands  of  relatively  bright  clusters.11  These  clusters  were  indicative  of  a lumpy 
overall  distribution  of  galaxies,  but  the  data  were  still  too  sparse  to  draw  definite 
conclusions  as  to  the  nature  ol  clustering.  Interestingly,  even  in  the  1930s,  as  such 
structures  were  first  being  observed,  some  astronomers  (most  notably  FVitz  Zwicky) 
were  suggesting  that  a bit  of  extra  dark  matter  must  be  present  to  gravitationally 
bind  these  clusters. 

Also  during  the  1950s,  it  was  proposed  that  we,  along  with  perhaps  thousands 
of  nearby  galaxies,  were  part  of  a so-called  local  supcrcluster.  This  innovative  idea 
The  standard  references  arc  to  Abell  (1958)  and  Zwicky  et  al.  (1961-68). 


30 

was  formulated  by  de  Vaucouleurs  (1953),  but  the  idea  of  such  a large  structure  met 
with  some  resistance,  and  only  in  the  late  1970s  were  skeptics  dually  convinced  that 
the  local  supercluster  really  existed.  It  was  difficult  to  extract  structural  information 
from  the  data  before  the  1970s,  largely  because  redshift  measurements  were  difficult 
and  therefore  scarce.  As  it  turned  out,  a large,  reliable  data  base  of  galactic  redshifts 
would  be  a crucial  element  in  the  mapping  of  cosmic  structure. 

To  describe  the  3-dimensional  distribution  of  galaxies  in  space,  we  need  detailed 
information  on  the  position  of  each  galaxy.  If  we  specify  the  right  ascension  and 
declination,  we  have  only  pinpointed  a galaxy’s  location  on  the  celestial  sphere.  To 
complete  the  picture,  we  need  to  know  its  distance  from  us.  Direct  distance  determi- 
nations using  standard  candles  are  few  and  far  between,  and  as  we  noted  earlier,  the 
numbeis  obtained  arc  still  a bit  unreliable.  Recall  that  Hubble’s  law  says  that  the 
recession  velocity  of  a galaxy  is  proportional  to  its  distance  from  us.  This  is  the  saving 
grace  of  observational  cosmology,  for  we  are  able  to  use  the  measured  recession  velocity 
as  an  approximate  indicator  of  distance.  Over  the  years,  observational  astronomers — 
who  are  an  interesting  mix  of  theoreticians  and  experimental  physicists — have  been 
able  to  provide  newer  and  faster  techniques  for  various  measurements,  including  fun- 
damental redshift  determinations.  Fancy  electronics  and  computers  have  been  the 
norm  for  some  time  now  in  astronomical  data  acquisition,  and  since  the  mid  1980s, 
improvements  in  the  redshift  data  base  have  been  significant. 

If  we  have  angular  positions  and  redshifts  for  a sample  of  galaxies,  then  we  are 
able  to  formulate  a picture  of  their  spatial  distribution.  The  redshift  is  a direct 
measurement  of  the  recession  velocity  in  the  following  sense.  If,  for  example,  the 
pattern  of  spectral  lines  for  a particular  galaxy  is  shifted  by  1%,  then  its  apparent 
recession  velocity  is  3,000  kilometers  per  second  (which  is  1%  of  the  speed  of  light). 


32 


“nearby."  i.e.  within  140  million  light-years  of  us.  and  relatively  bright.  The  figure 
is  drawn  from  the  perspective  of  a far  away  observer  who  sees  the  local  snpcrcluster 
pancake  from  an  edge  on  view;  the  pancake  edge  is  horizontal  in  this  figure.  As  a 
technical  note,  we  point  out  that  in  Figure  1-4  the  dots  are  plotted  using  the  SGY  and 
SGZ  coordinates  from  de  Vaucouleurs’  super  galactic  X.Y.Z  coordinate  system.  These 
coordinates  were  specifically  chosen  so  that  the  SGZ  axis  would  be  perpendicular  to 
the  pancake.  On  the  lower  right  part  of  the  pancake,  the  Virgo  Cluster,  the  nearest 
significant  cluster  of  galaxies,  is  apparent. 

Figure  1-4  illustrates  an  important  aspect  of  our  view  of  the  universe.  Note  that 
there  are  few  dots  plotted  in  the  two  30  degree  wedges  in  the  upper  and  lower  parts  of 
the  figure.  This  is  because  we  make  astronomical  observations  from  within  the  thin 
disk  of  our  Milky  Way,  which  contains  some  obscuring  dust.  Within  about  15  de- 
grees of  the  galactic  plane,  there  is  essentially  a zone  of  avoidance  for  extragalactic 
observations. 

What  do  things  look  like  when  we  probe  far  beyond  the  local  superclustor?  We 
can  get  an  idea  of  what  the  very  distant  universe  looks  like  by  examining  a few 
images  obtained  and  processed  by  the  electronic  instruments  associated  with  the 
Hubble  Space  Telescope.  Perusing  such  images  may  help  us  appreciate  the  nature 
of  deep  space  data.  But  to  get  a feel  for  the  nature  of  large-scale  structure  we  will 
look  at  maps  which  were  produced  from  the  compilation  and  analysis  of  very  many 
astronomical  images. 

When  we  analyze  stars  or  galaxies  through  a telescope,  we  need  a quantitative 
way  to  characterize  how  bright  or  dim  the  objects  appear  to  us.  To  do  this,  we 
customarily  extend  the  system  developed  by  the  ancient  Greeks,  who  divided  the 
night  sky  stars  into  six  classes  of  brightness.  The  brightest  stars  were  classified  as 
first  magnitude,  and  classes  were  equally  divided  down  to  the  dimmest  stars  that  could 


34 

be  seen  with  the  naked  eye,  and  these  were  dubbed  6th  magnitude.  In  modern  times, 
we  keep  track  of  the  amount  of  electromagnetic  radiation  deposited  on  the  telescope’s 
photographic  plates  or  electronic  detectors.  Because  of  the  great  light  gathering 
capabilities  of  telescopes,  and  because  modern  instruments  are  sensitive  to  levels  of 
radiation  millions  of  times  dimmer  than  humanly  detectable,  we  may  see  deep,  faint 
images  of  distant  galaxies.  Human  eyes  are  logarithmically  sensitive,  and  it  turns 
out  that  the  six  magnitudes  of  brightness  (which  we  still  call  1st  through  6th),  which 
describe  nearly  all  the  night  sky  stars,  actually  correspond  to  electronically  detected 
radiation  energies  spanning  a factor  of  100.  We  continue  this  system  in  the  sense  that 
the  higher  the  magnitude  of  a star  or  galaxy,  the  dimmer  is  the  radiation  we  receive; 
and  an  increase  of  5 magnitudes  always  corresponds  to  a decrease  by  a factor  of  100  in 
the  amount  of  radiation  received.  The  thousands  of  galaxies  in  the  local  supercluster 
arc  too  dim  to  see  with  the  naked  eye.  Most  of  them  have  apparent  brightnesses 
somewhere  in  the  range  of  9th  to  14th  magnitude  (note  that  an  11th  magnitude 
galaxy  is  a hundred  times  too  dim  to  see  with  the  unaided  eye).  The  Automatic  Plate 
Measuring  (APM)  galaxy  survey  contains  about  2 million  galaxies  brighter  than  the 
19th  magnitude,  and  therefore  this  survey  probes  far  beyond  the  local  supercluster.  In 
fact,  the  APM  survey  can  detect  a galaxy  more  than  1000  million  light-years  distant. 

The  Coma  cluster  of  galaxies  is  considered  to  be  a fairly  typical  large,  regular 
cluster.  This  cluster  is  so  named  because  all  of  its  galaxies  appear  near  the  constella- 
tion Coma  Bernices  on  the  celestial  sphere.  There  are  a few  thousand  galaxies  with 
similar  redshifts  and  angular  coordinates,  which  would  seem  to  indicate  that  they  all 
form  a fairly  tight  group.  Further  analysis  of  the  system  (e.g.  dynamical  analyses, 
independent  distance  determinations)  confirms  this.  The  Coma  cluster  galaxies  have 
recession  velocities  near  7000  kilometers  per  second,  which  means  that  they  are  all 


about  350  million  light-years  distant.  The  galaxies  within  the  core  of  the  cluster  lie 
within  one  degree  of  each  other  on  the  celestial  sphere. 

As  a bit  of  galactic  trivia,  we  note  that  the  core  of  the  Coma  cluster  lies  very 
near  the  North  Galactic  Pole.  Our  navigational  North  Star,  Polaris,  lies  near  the 
North  Celestial  Pole  defined  by  the  Earth’s  spin  axis.  The  North  Galactic  Pole  is 
analogously  defined  by  the  Milky  Way’s  spin  axis.  Because  the  Earth's  spin  axis 
wobbles  (the  axis  completes  one  precession  every  25,000  years),  Polaris  will  not  be 
the  best  "North  Star”  in  a few  thousand  years.  But  even  a million  years  from  now, 
the  Coma  cluster  will  still  be  an  excellent  indicator  of  the  North  Galactic  Pole. 

The  recently  repaired  Hubble  Space  Telescope  (HST)  has  provided  some  of  the 
finest  deep  images  yet  obtained.  We  display  two  HST  images,  drawn  from  recent 
press  releases.  The  first,  Figure  1-5,  shows  a clear  image  of  two  Coma  cluster  galaxies 
(350  million  light-years  distant)  along  with  some  background  galaxies  far  beyond. 
Figure  1-6  shows  a sequence  of  state  of  the  art  faint  images  of  distant  galaxies,  some 
of  them  billions  of  light-years  distant. 

In  Figure  1-5,  we  see  a giant  elliptical,  13th  magnitude  galaxy,  known  as  NGC  4881, 
in  the  center  of  the  image.  To  its  right  is  a spiral,  16th  magnitude  galaxy.  Both  of 
these  are  members  of  the  Coma  Cluster,  although  they  reside  about  a third  of  a de- 
gree from  the  cluster  center.  The  image  is  fairly  narrow,  only  a few  arc  minutes  across 
(NGC  4881  has  an  angular  diameter  of  about  1 arc  minute),  and  so  does  not  include 
the  many  other  cluster  members.  Aside  from  a few  foreground  stars  (from  our  Milky 
Way),  the  other  objects  in  the  background  are  galaxies  far  beyond  the  Coma  Cluster, 
The  figure  has  an  odd  shape  since  it  is  actually  a graft  of  four  images,  three  large 
squares  and  one  smaller  square  in  the  center,  from  the  three  Wide  Field  Cameras 
and  the  one  Planetary  Camera  aboard  the  orbiting  telescope.  Even  with  the  very 
sensitive  electronics  used,  this  image  required  16  exposures,  each  lasting  15  minutes. 


37 

In  a couple  of  the  large-scale  maps  we  will  show  shortly  (Figures  1-7  and  1-9),  the 
Coma  Cluster  will  appear  in  its  entirety. 

To  compare  this  image  with  the  six  galaxies  previously  shown  in  Figures  1-1  and 
1-2,  note  that  those  six  galaxies  were  of  the  12th  or  13th  magnitude,  were  three  to 
live  times  closer  than  the  Coma  Cluster,  and  had  angular  diameters  of  around  3 arc 

As  we  look  at  ever  more  distant  objects,  we  are  probing  further  into  the  past. 
Nearby  galaxies  (closer  than,  say,  100  million  light-years)  are  viewed  as  our  contem- 
poraries, as  the  light  we  receive  from  them  was  emitted  only  100  million  years  ago. 
Galaxies  which  are  1000  million  light-years  distant  (meaning  that  we  arc  receiving 
light  emitted  a billion  years  ago)  appear  to  be  similar  to  nearby  galaxies,  although 
there  can  be  signs  of  galactic  evolution  in  their  spectroscopic  data.  The  first  real 
visual  impression  of  galactic  evolution  was  seen  in  the  high  resolution  images  of  faint, 
distant  galaxies  obtained  by  the  Hubble  Space  Telescope,  as  shown  in  Figure  1-6  from 
an  HST  press  release  (Dressier  et  al.  1994). 

As  we  go  from  left  to  right  in  Figure  1-6,  each  column  shows  images  from  suc- 
cessively older  epochs.  The  two  galaxies  in  the  left  hand  column  of  Figure  1-6  are 
nearby.  Their  recession  velocities  are  less  than  one  percent  of  the  speed  of  light,  and 
so  they  are  nearby  and  represent  our  contemporaries.  The  three  galaxies  in  the  second 
column  have  recession  velocities  near  30%  of  the  speed  of  light,  and  we  are  viewing 
them  when  the  universe  was  about  two  thirds  its  present  age.  The  third  column 
shows  galaxies  when  the  universe  was  about  one  third  its  present  age;  the  recession 
speeds  are  roughly  equal  to  the  speed  of  light.  The  last  column  contains  galaxies  with 
recession  velocities  over  300%  of  the  speed  of  light;  these  are  some  of  the  most  distant 
galaxies  currently  resolvable.  The  distinction  between  ellipticals  and  spirals  becomes 
quite  blurry,  although  the  image  in  the  top  right  has  the  spectroscopic  profile  of  a 


mature  elliptical.  Analyses  of  deep  images  are  beginning  to  reveal  some  of  the  secrets 
of  how  galaxies  form,  evolve,  and  even  merge  with  each  other. 

1.5  Mapping  the  Large-scale  Structure 

In  order  to  make  headway  into  the  large-scale  structure  problem,  we  do  not  have 
to  probe  nearly  as  far  back  as  the  faint  images  in  Figure  1-6.  The  current  maps 
of  the  large-scale  galaxy  distribution — we  show  a few  such  maps  below — provide  a 
significant,  though  still  incomplete,  glimpse  of  the  large-scale  structure  picture.  The 
COBE  cosmic  background  radiation  fluctuation  data  provide  important  constraints 
on  the  dumpiness  of  the  ancient  universe.  Since  radiation  and  matter  were  coupled  in 
the  radiation  era  of  the  early  universe,  the  smallness  of  the  background  fluctuations 
detected  by  COBE  implies  that  the  distribution  of  matter  was  not  very  clumpy. 
If  we  consider  both  the  current  galaxy  maps  and  the  constraints  provided  by  the 
cosmic  background  radiation  data,  then  it  would  appear  that  a reasonably  fair  or 
statistically  representative  sample  of  galaxies  can  be  obtained  using  galaxies  within 
about  1500  million  light-years  (i.e.  with  recession  velocities  less  than  about  10%  of  the 
speed  of  light).  While  there  are  millions  of  galaxies  this  close  to  us,  it  is  a technically 
difficult  matter  to  obtain  large  samples  of  galaxies  where  the  samples  are  in  any  sense 
complete  or  thorough. 

The  Gist  large-scale  mapping  project  was  carried  out  by  Shane  and  Wirtanen  dur- 
ing the  1960s.  By  visually  analyzing  photographic  plates  from  the  Lick  Observatory, 
they  were  able  to  tabulate  the  angular  positions  of  about  a million  galaxies  on  the 
celestial  sphere.  The  survey  plates  which  they  inspected  included  galaxies  down  to 
a dimness  of  about  the  18th  magnitude.  Galaxies  come  in  various  sizes  and  intrinsic 
luminosities — we  say  "intrinsic"  to  distinguish  the  luminosity  (how  much  radiation  a 
galaxy  actually  emits)  from  the  directly  observed  apparent  brightnesses  (how  much 


40 

radiation  we  detect  far  away).  Since  we  have  some  statistical  information  concern- 
ing the  distribution  of  galactic  luminosities,  we  can  fill  in  the  scientific  details  of  the 
following  truism:  surveys  which  include  dimmer  galaxies  are  probing  deeper  into  the 
universe.  Since  the  Lick  catalog  includes  galaxies  down  the  the  18th  magnitude,  we 
can  estimate  that  it  contains  galaxies  out  to  a depth  of  about  900  million  light-years. 

To  collect  the  data  for  the  Lick  Observatory  “million  galaxies"  map,  part  of  which 
is  shown  in  Figure  1-7,  Shane  and  Wirtanen  analyzed  more  than  one  thousand  slightly 
overlapping  photographic  plates,  where  each  plate  was  6°x6°  in  size.  Each  plate  was 
divided  into  1°  x 1°  squares  for  data  analysis,  and  each  square  was  further  divided 
into  36  ceils,  10  arc  minutes  on  a side.  As  it  turned  out,  on  average  there  were 
about  1.5  galaxies  per  cell,  with  roughly  one  third  of  the  cells  having  no  galaxies,  one 
third  having  one  galaxy,  and  one  third  having  two  or  more  galaxies  (about  4000  of  the 
million  or  so  cells  had  10  or  more  galaxies).  Figure  1-7  is  a gray-scale  representation  of 
these  galaxy  counts,  where  white  corresponds  to  no  galaxies,  black  to  several  galaxies. 

The  raw  data  behind  the  original  publication  (Shane  & Wirtanen  1967)  were 
later  reprocessed  in  the  “New  Reduction  of  the  Lick  Catalog  of  Galaxies"  (Seldner, 
Siebers,  Groth,  & Peebles  1977).  This  later  publication  provided  the  “million  galaxy" 
picture,  part  of  which  was  electronically  scanned  to  make  Figure  1-7.  Our  reproduced 
image  has  less  resolution  than  the  original  image,  which  has  a 16  shade  gray-scale 
corresponding  to  cell  counts  of  0, 1, 2, . . . galaxies.  But  even  the  original  has  less 
resolution  than  the  10  arc  minute  fineness  of  the  data. 

What  can  we  learn  by  just  looking  at  Figure  1-7?  The  image  seems  to  reveal  some 

At  the  risk  of  using  an  oxymoron,  this  is  structure  which  is,  loosely  speaking,  random 
or  statistical.  At  the  same  time,  it  seems  obvious  that  the  image  is  not  “random"  in 


Figure  1-7.  Four  hundred  thousand  galaxies— part  of  the  Lick  Observatory  “million 
galaxies"  survey  (Shane  & Wirtanen  1967).  The  galaxy  counts  are  plotted  in  grey- 
scale: a black  pixel  corresponds  to  many  galaxies,  a white  pixel  to  no  galaxies.  This 
image  shows  galaxy  positions  on  a section  of  the  celestial  sphere  within  about  50°  of 
the  North  Galactic  Pole.  Credit:  Seldncr  et  al.  (1977). 


the  colloquial  sense;  the  image  does  not  correspond  to  the  picture  one  would  get  by 
randomly  sprinkling  flecks  of  black  pepper  on  a piece  of  paper. 

Remember  that  this  picture  shows  only  angular  positious  on  the  celestial  sphere. 
If  there  is  structure  here  (as  discerned  by  scientific  algorithms),  what  kind  of  three- 
dimensional  structure  does  it  correspond  to? 

In  1970,  the  great  Russian  scientist  Yakov  Borisovich  ZcPdovich  wrote  a simple, 
brilliant  paper  entitled  "Gravitational  Instability:  An  Approximate  Theory  for  Large 
Density  Perturbations."  This  paper  (Zel'dovich  1970)  indicated  that  in  an  expanding 
univeise,  small  perturbations  from  an  initial  uniform  matter  distribution  would  col- 
lapse asymmetrically,  resulting  in  flattened  “pancakes"  where  the  density  of  matter 
was  large  relative  to  the  mean.  Gravitation  operates  on  all  scales  equally  (we  say  that 
gravity  is  “scale-free" ) , so  without  added  assumptions  concerning  initial  conditions,  it 
is  not  obvious  what  size  the  pancakes  would  be.  Is  the  flattened  structure  in  Figure  1- 
4 such  an  object?  Is  there  a flattening  in  smaller  structures  such  as  clouds  of  galaxies? 
Does  this  pancake  instability  operate  on  scales  larger  than  the  local  supercluster?  In 
the  hierarchy  of  clustering,  do  smaller  structures  containing  only  10,  100,  or  1000 
galaxies  fit  into  the  pancaking  scheme  on  a scales  containing  10.000  galaxies?  The 
answer  to  all  these  questions  is  a qualified  “ves"  (qualified  because  the  answer  really 
is  not  very  clear).  It  is  tempting  to  look  at  Figure  1-7  and  perceive  a “filamentary" 
sort  of  substructure  that  runs  throughout,  as  might  result  from  large-scale  pancaking. 

The  most  satisfying  result  of  this  picture  comes  from  statistical  measures  which 
have  no  obvious  visual  interpretation.  We  consider  the  statistical  correlations  (Peebles 
1993)  of  the  galaxy  angular  positions  shown  in  Figure  1-7.  Pick  a galaxy  from  the 
picture.  Then  see  how  many  galaxies  are  0.5°  away  from  it.  Compare  this  with 
how  many  galaxies  we  would  get  if  the  data  were  from  a totally  random  catalog 
(e.g.  black  pepper  on  paper).  There  will  be  either  an  excess  (positive  correlation)  or 


43 

scarcity  (negative  correlation)  compared  to  the  random  catalog.  Do  this  procedure  tor 
each  galaxy,  and  for  various  angles,  e.g.  0.6°,  0.7°  etc.,  and  then  we  have  the  2-point 
correlations  as  a function  of  angular  separation. 

Here  is  the  important  question.  Do  these  angular  correlations  appear  to  be  a 
projection  of  full  three-dimensional  data  which  are  a realisation  of  a homogeneous 
statistical  process?  That  deserves  a translation.  With  our  concept  of  homogeneity, 
we  are  asking  if  a chunk  of  the  universe  in  one  place,  measuring  (e.g.)  300  million 
light-years  on  a side,  is  statistically  similar  to  another  same-sized  chunk  in  another 
location.  To  put  it  another  way  (still  rather  cryptically),  is  the  concept  of  a unique 
mean  density  of  the  universe  well-defined?  And  does  the  clustering  hierarchy  finally 
end,  so  that  there  are  such  things  as  “fair  samples'*  of  the  universe? 

The  answer  is  almost  certainly  yes.  We  will  stop  there,  only  pointing  out  that  the 
mathematical  deduction  involves  analyzing  the  angular  correlations  of  various  galaxy 
catalogs  to  various  depths  (where  bright  catalogs  are  shallow,  or  nearby,  and  dim 
catalogs  are  deep)  relative  to  the  million  galaxy  correlations.  After  comparing  the 
correlations  of  the  catalogs  of  different  depths,  we  see  that  the  data  are  consistent 
with  a statistically  homogeneous  universe. 

In  the  1980s,  a new  improved  large  scale  survey  was  carried  out  by  a team  of 
Oxford  astronomers  using  the  UK  Schmidt  Telescope  in  Australia.  This  team  used  the 
Cambridge  Automated  Plate  Measuring  system  to  tabulate  the  angular  coordinates 
and  magnitudes  of  galaxies  appearing  on  185  photographic  plates  and  produce  what 
is  known  as  the  APM  survey.13  The  survey  region  was  a large  field  (several  thousand 
square  degrees,  roughly  50°  x 100°)  near  the  South  Galactic  Pole.  Galaxies  down  to  a 
20.5  magnitude  were  included,  so  the  APM  catalog  is  about  three  times  deeper  than 
ce  montage  of  the  photographic  plates,  see 


Figure  1-8.  Two  million  galaxies  from  the  Automatic  Plate  Measuring  survey.  Galaxy 
counts  on  the  celestial  sphere  are  plotted  in  grey-scale,  as  in  Figure  1-7.  This  equal 
area  projection  covers  a contiguous  area  of 4300  square  degrees  surrounding  the  South- 
ern Galactic  Pole.  The  APM  and  Lick  survey  maps  show  a frothy  texture  in  the  dis- 
tribution of  galaxies,  due  to  gravitational  clustering.  Credit:  Maddox  et  al.  (1990), 
Astrophysics  Department,  Oxford  University. 


45 

the  Shane  and  Wirtanen  catalog  (which  included  galaxies  down  to  magnitude  18.5). 
Figure  1-8  shows  a grayscale  rendering  of  the  catalog,  similar  to  the  map  of  Figure  1-7. 
There  is  more  information  in  the  APM  map.  and  it  is  harder  to  see  the  frothy  signal 
that  seemed  to  occur  in  the  “million  galaxies"  map.  Note  that  if  we  take  subsamples 
of  varying  depths  of  the  APM  catalog,  we  find  angular  correlations  corresponding 
to  a projection  of  a three  dimensional  homogeneous  statistical  process,  as  described 
above  in  the  case  of  the  Lick  Observatory  survey.  Most  importantly,  this  newer  map, 
culled  and  analyzed  independently,  agrees  statistically  with  the  Lick  survey  and  with 
other  shallower  surveys.  This  cross  checking  is  crucial  and  it  helps  to  demonstrate 
that  the  large-scale  structure  problem  is  being  appropriately  addressed. 

In  the  late  1970s  and  early  1980s,  the  nature  of  the  full  three  dimensional  structure 
of  the  galaxy  distribution  began  to  be  unveiled.  To  make  a three-dimensional  map,  we 
need  the  redshift  of  each  galaxy  in  addition  to  its  angular  position.  Redshift  surveys 
containing  hundreds  of  galaxies  during  this  time  seemed  to  indicate  structures — 
voids  (regions  containing  few  galaxies)  and  chains  (narrow  regions  containing  many 
galaxies)— spanning  surprisingly  large  scales.  Could  the  homogeneous  big  bang  model 
with  gravitational  instability  really  produce  structures  which  span  100  million  light- 
years?  While  not  heretical,  such  notions  of  large  structures  were  doubtful  at  best. 

All  this  changed  in  1986  with  the  publication  of  “A  Slice  of  the  Universe"  by 
de  Lapparent,  Geller,  and  Huchra.  as  part  of  the  CfA  (Harvard  Center  for  Astro- 
physics) survey.  Since  redshifts  are  difficult  to  measure,  their  survey  strategy  was  to 
observe  galaxies  in  a thin  6 degrees  of  declination  but  to  span  a large  135  degrees  in 
right  ascension.  They  limited  the  survey  to  galaxies  brighter  than  magnitude  15.5. 
so  their  survey  would  not  be  nearly  as  deep  as  the  Lick  or  APM  projects.  It  did  not 
have  to  be — it  was  still  plenty  of  work  to  catalog  a few  thousand  galaxy  redshifts. 
and  they  ended  up  choosing  several  “slices”  to  survey.  Their  famous  first  slice  is 


plotted  in  Figure  1-9.  The  distance  (obtained  from  the  redshift  by  Hubble's  law) 
is  plotted  radially.  Note  that  a redshift  of  5000  km/s  corresponds  to  a distance  of 
about  230  million  liglu-yeais  (and  therefore  10000  km/s  corresponds  to  460  million 
light-years).  The  right  ascension  is  plotted  across  the  top.  The  six  degrees  of  decli- 
nation is  thin  enough  that  we  consider  it.  to  be  of  negligible  thickness,  and  the  “slice" 
is  simply  plotted  on  the  paper  with  each  dot.  corresponding  to  one  galaxy.  There  are 
1057  galaxies  shown.  The  map  starts  to  thin  out  at  about  10000  km/s,  since  at  that 
distance,  many  galaxies  will  not  appear  bright  enough  to  be  in  the  survey. 

If  the  universe  contained  pancakes  or  frothy,  bubbly  structures  containing  an 
excess  of  galaxies,  then  what  would  a slice  through  those  types  of  structures  look 
like?  In  physics,  we  can  sometimes  let  the  data  speak  for  themselves.  Many  folks 
found  the  data  shown  in  Figure  1-9  to  be  interesting  or  even  disturbing.  If  the  universe 
is  really  homogeneous,  then  why  does  that  map  not  look  homogeneous?  That  map  is 
not  yet  a “fair  sample"  of  the  universe.  We  need  to  look  at  a bigger  sample,  but  bigger 
samples  are  not  easy  to  come  by.  However,  the  sample  required  cannot  be  arbitrarily 
large.  If  a survey  is  two  or  three  times  bigger  than  the  CCA  slice  but  still  seems 
completely  inhomogeneous,  then  there  will  be  serious  theoretical  inconsistencies  in 
the  overall  big  bang  picture. 

Before  we  move  on,  let  us  note  a couple  of  geographical  (or  rather  cosmographical) 
highlights  of  these  maps.  The  local  supercluster  shown  back  in  Figure  1-4  can  be 
partially  seen  in  Figure  1-9.  Right  next  to  the  apex  of  the  slice,  spanning  less  than 
a centimeter  on  the  diagram,  there  is  a smalt  chain  a galaxies  which  is  part  of  our 
local  supercluster.  Back  in  Figure  1-5.  we  saw  two  highly  resolved  galaxies  which  are 
are  part  of  the  Coma  cluster.  Many  of  the  Coma  cluster  galaxies  are  prominently 
displayed  in  Figure  1-9.  At  recession  velocity  7000  km/s  and  right  ascension  12.5 
hours  (right  in  the  upper  torso  of  the  humanoid  figure),  that  crowded  blob  of  galaxies 


47 


Figure  1-9.  A slice  of  the  universe.  Each  of  the  1057  dots  represents  a galaxy  from 
the  CfA  redshift  survey  (de  Lapparent  et  al.  1986).  The  Milky  Way  galaxy  is  not 
shown — it  would  be  represented  by  a dot  at  the  apex  of  the  slice.  The  recession 
velocity  of  each  galaxy  is  proportional  to  its  distance  from  us,  via  Hubble's  law.  To 
get  an  idea  of  the  distances  involved,  note  that  a recession  velocity  of  10,000  km/s 
corresponds  to  a distance  of  about  500  million  light-yeats.  The  galaxies  in  this  survey 
are  located  in  a slice  which  is  only  6 degrees  in  thickness  but  which  carves  out  a 
117  degree  swath  through  space.  Dramatic  Large-scale  Structure  is  revealed.  Data 
provided  in  electronic  format  by  Margaret  Geller. 


is  the  Coma  cluster.  Looking  at  the  Million  Galaxies  map  of  Figure  1-7, 


pinpoint  the  Coma  cluster  if  we  look  carefully.  Go  to  the  center  of  the  figure,  then 
down  and  right  by  a centimeter  or  so.  That  small  blob  is  the  Coma  cluster,  sitting 
at  the  North  Galactic  Pole. 

All  of  the  redshifts  which  have  accumulated  over  the  years  are  a crucial  tool  in  the 
analysis  of  the  large-scale  structure  problem.  Gut  only  large,  highly  regulated  surveys 
can  provide  the  control  necessary  statistical  analyses.  By  1960,  there  were  about  GOO 
published  redshifts.  By  1976,  there  were  about  2700  measured  redshifts,  provided  by 
various  astronomers  studying  many  different  regions  of  interest,  The  1990  total  was 
somewhere  around  20  thousand.1-1 

Only  very  recently  has  a larger  sample  become  feasible.  With  the  electronic 
detector  innovations  of  Shectman  and  his  colleagues  in  the  Los  Campanos  Redshift 
Survey  team,  they  have  smglehandedly  doubled  the  redshift  database,  providing  about 
26,000  redshifts  in  six  slices.  The  data  were  made  public  by  Shectman  et  al.  (1996), 
and  we  plot  one  of  the  slices  in  Figure  1-10.  The  style  of  plot  is  the  same  as  the  CfA 
slice  plot.  The  recession  velocity  (labeled  c c)  is  plotted  radially  out  to  45000  km/s, 
about  three  times  deeper  than  the  CfA  slice.  Whereas  the  declination  of  the  CfA  slice 
covered  six  degrees  of  declination  centered  at  29.5°,  this  slice  is  only  three  degrees 
thick,  centered  al  a declination  of  —12°,  so  the  slices  are  forty  degrees  apart  and  do 
not  overlap  (even  though  their  right  ascensions  do  overlap). 

The  Los  Campanos  slice  of  Figure  1-10  shows  4071  galaxies.  While  not  as  visually 
striking  as  the  CfA  slice,  it  appears  to  show  a fair  sample,  since  it  contains  many 
repetitions  of  structures  which  are  smaller  than  the  slice  itself.  Perhaps  the  Los 
Campanos  slices  did  not  merit  the  same  media  attention  as  followed  the  CfA  slice 


These  numbers  were  obtained  from  Dressier  (1994)  and  Giovanelli  and  Haynes  (1991). 


Right 


Figure  1-10.  A recently  obtained  slice  of  the  universe.  Each  of  the  4071  dots  repre- 
sents a galaxy  from  the  Los  Campanas  Redshift  Survey  (Shectmnn  et  al.  1996).  This 
slice  is  deeper  than  the  CfA  slice  and  shows  a different  part  of  the  universe.  Note 
that  a recession  velocity  (obscurely  abbreviated  as  cc  in  the  Figure)  of  40,000  km/s 
indicates  a distance  of  about  1.9  billion  light-years,  from  Hubble's  Law. 


50 

of  1986  simply  because  its  results  were  not  as  controversial.  Cosmologists  actually 
expected  the  results  of  Figure  1-10.  whereas  they  were  a bit  shocked  by  the  image  of 
Figure  1-9  when  it  came  out. 

Are  the  redshift  data  consistent  with  each  other  and  with  the  large  angular  maps? 
Yes.  although  the  answer  is  not  necessarily  trivially  established,  due  to  selection 
effects.  Survey  galaxies  are  always  scientifically  selected  for  inclusion,  yet  the  surveys 
are  never  ideal  samples,  and  comparing  surveys  can  be  problematic.  Even  obtaining 
a reliable  result  from  one  survey  can  be  difficult,  as  a particular  result  might  be 
selection-induced  or  at  least  selection-biased.  In  scientific  endeavors,  the  details  are 
often  messy,  and  cosmology  is  no  different  than  other  sciences. 

As  a relatively  simple  technical  example,  we  can  compare  the  2-point  spatial  cor- 
relations. We  pick  a galaxy  from  the  redshift  survey.  Then  we  ask  how  many  galaxies 
are  10  million  light-years,  11  million  light-years,  etc.  from  from  a galaxy.  Is  there 
a positive  correlation  as  compared  to  a totally  random  catalog?  Continuing  in  this 
manner  gives  us  the  2-point  correlations  as  a function  of  distance.  Within  uncertain- 
ties which  must  be  partly  selection-induced,  the  spatial  correlations  between  the  two 
redshift  surveys  agree,  and  they  correspond  to  the  equivalent  angular  correlations  as 
obtained  from  the  "million  galaxies"  map.  Interestingly,  there  does  not  seem  to  be 
anything  in  the  correlation  statistics  alone  that  reveals  (or  would  have  enabled  people 
to  predict,  back  in  the  1970s)  the  bubbly  nature  of  large-scale  structure. 

L£ Statistics  and  Simulations 

The  galaxy  maps  seein  to  reveal  structures.  Statistical  measures  back  up  this 
intuitive  observation.  We  believe  that  the  distribution  of  galaxies  over  large  scales  is 
providing  information  about  all  of  the  matter  contained  in  these  large  samples  of  the 
universe.  There  is  no  guarantee  of  this,  but  we  do  need  to  assume  that  the  galaxies 


■ faithful  tracers  of  the 


dark  as  well  as 

luminous  matter. 

Recall  that  there  is  physical  evidence  which  supports  the  big  bang  model— in 
particular,  that  the  universe  was  hotter  and  smoother  in  the  past.  Looking  at  the 
redshlft  maps  in  Figures  1-9  and  1-10.  we  see  that  there  arc  regions,  devoid  of  galaxies, 
where  the  matter  density  is  well  below  the  mean,  and  there  are  clumpy  regions  where 
the  density  is  well  above  the  mean.  Hence,  we  say  that  today  there  are  large  density 
fluctuations,  and  we  may  gather  statistics  concerning  these  fluctuations  (the  2-point 
correlations  mentioned  in  the  previous  section  are  just  one  example  of  the  various 
statistical  measures). 

Gathering  large-scale  structure  statistics  might  be  rather  dry  were  it  not  for  the 
high  goals  of  theoretical  physics.  The  COBE  satellite  data-  as  well  as  other  modern 
data,  say  that  the  density  fluctuations  were  small  in  the  ancient  past,  in  the  radiation 
era,  before  there  were  stars  and  galaxies.  It  is  natural  to  attempt  to  account  for  the 
nature  of  the  density  fluctuations  and  their  growth  over  billions  of  ycais  using  the 
basic  tools  of  physics,  Zel’dovich,  Peebles  and  many  of  their  physics  colleagues  have 
been  dealing  with  this  problem  for  a while  now.  This  problem  includes  the  thorny 
issue  of  galaxy  formation.  The  connection  between  dynamics  (primarily  gravitational 
instability)  and  the  large-scale  structure  statistics  is  also  part  of  the  problem.  Below, 
we  present  material  related  to  the  dynamics/statistics  question. 

A simplified  mathematical  model  of  the  gravitational  instability  problem  will  al- 
low us  to  illustrate  some  of  the  features  of  the  large-scale  structure  problem.  Consider 
a fluid  in  an  expanding  universe.  The  matter  content  of  the  universe  is  represented 
by  the  fluid  which  fills  up  all  of  space,  and  the  space  itself  is  expanding  according  to 
the  precepts  of  general  relativity.  As  a zeroth  order  approximation,  we  say  that  the 
universe  in  this  model  is  exactly  homogeneous.  The  matter  density  of  the  universe 


is  the  same  at  all  locations  and  that  the  fluid  velocity  is  zero  everywhere.  Such  an 
idealization  is  not  ridiculous,  as  it  approximately  applies  to  the  early  universe.  The 
mean  density  (which  is  the  same  everywhere)  is  decreasing  with  time  simply  because 
the  universe  is  expanding  and  cooling.  Even  though  a typical  fluid  element  has  no 
velocity,  it  appears  to  be  receding  from  all  other  points  according  to  the  Hubble  law. 
since  the  space  itself  is  expanding. 

We  start  off  the  gravitational  instability  problem  by  saying  that  the  initial  density 
is  not  perfectly  uniform.  At  an  early  time,  corresponding  to  the  decoupling  era  some 
ten  billion  years  ago,  we  specify  that  the  density  is  nearly  the  same  everywhere. 
Small  fluctuations  are  imposed  as  initial  conditions.  The  fluctuations  are  random, 
but  not  totally  random:  there  arc  spatial  correlations  in  the  fluctuations.  The  fluid  is 
considered  to  be  self-gravitating — each  fluid  element  is  attracted  to  any  other  element 
by  the  laws  of  gravity.  In  this  model,  as  we  concentrate  on  one  region  of  the  universe, 
we  see  that  the  whole  region  expands  and  that  the  fluid  gathers  together  in  a clustering 
pattern  as  time  goes  on.  When  the  model  reaches  the  “present"  time,  there  are  large 
fluctuations  in  the  density  from  place  to  place,  somewhat  like  the  galaxy  maps. 

Let  us  show  how  some  computer  simulations  represent  the  growth  of  fluctuations. 
We  have  limited  computer  resources,  so  we  cannot  specify  the  density  everywhere  in 
our  simulated  universe.  Instead,  a "large"  region — corresponding  to  an  "observable" 
patch  of  this  fake  universe — millions  of  light-years  in  dimension,  is  chosen.  At  perhaps 
one  or  two  million  locations,  we  specify  the  initial  matter  density  including  small 
fluctuations.  Then  the  fluctuations  are  allowed  to  grow  according  to  the  laws  of 
gravity.  There  are  various  technical  choices  which  must  be  made  to  implement  this 
last  task. 

First,  we  show  some  results  from  a simulation  in  a cubical  volume.  The  initial 
fluctuations  were  specified  at  two  million  locations,  and  at  a later  time,  when  the 


53 

fluctuations  have  grown  as  the  fluid  has  clustered,  a sampling  of  “galaxies”  is  taken. 
Two  adjacent  slices  from  a simulation  are  shown  in  Figure  1-11.  Note  that  some 
filamentary  structures  in  one  slice  are  also  present  in  the  adjacent  slice.  If  there  is  a 
pancake  structure  in  three  dimensions,  and  we  take  two  slices,  this  is  what  we  would 

As  a final  example,  we  show  some  slices  obtained  from  high  resolution  two- 
dimensional  simulations,  and  we  illustrate  statistical  differences  in  an  intuitive  way. 

The  matter  density  is  randomly  assigned  values  at  one  million  locations  in  a slice. 
Some  jargon  is  helpful  in  understanding  the  figures  that  follow.  We  speak  of  a “density 
fluctuation  or  density  contrast”  of  -0.01  if  the  density  is  1%  under  the  mean.  If 
the  density  at  a particular  place  is  2%  or  200%  over  the  mean,  then  that  location 
has  a density  contrast  of  0.02  or  2,  etc.  Initially,  all  the  density  fluctuation  values 
are  near  zero — the  fluctuations  are  small,  as  they  were  in  the  actual  early  universe. 
As  time  time  goes  by  in  the  simulation  and  gravity  has  had  its  way  with  the  fluid, 
some  matter  may  pile  up  into  a pancake  structure  where  the  density  contrast  is  1 or 
2 (or  more).  If  a region  becomes  extremely  underdense,  then  at  most  it  would  have  a 
negative  fluctuation  of  100%  (a  density  contrast  of  —1),  corresponding  to  zero  matter 
density. 

Consider  the  following  example  which  illustrates  the  "randomness"  of  the  initial 
conditions.  We  perform  two  experiments.  In  each,  we  pick  5000  values  of  the  density 
contrast  near  zero,  distributed  according  to  the  bell  curve.  The  results  are  shown  in 
the  two  histograms  of  Figure  1-12.  By  construction,  in  each  plot,  the  vast  majority  of 
the  fluctuation  values  are  near  zero  (the  histograms  seem  to  be  centered  on  zero).  If 
we  look  closely  at  both  histograms,  we  see  that  they  are  different.  But  it  is  intuitively 
dear  that  they  “have  the  same  statistics,”  as  they  both  follow  the  same  bell  curve 


Figure  1-11.  Two  slices  from  a computer  simulation.  Similar  to  actual  redshift 
surveys,  each  dot  corresponds  to  a galaxy.  Since  the  slices  arc  close  to  each  other, 
structures  in  the  top  slice  carry  over  into  the  bottom  slice. 


55 

(the  dotted  curve  in  each  figure).  In  all  our  simulations,  we  arrange  for  the  initial 
fluctuations  to  be  distributed  randomly,  according  the  bell  curve. 

For  each  set  of  initial  fluctuations,  wc  also  specify  the  2-point  spatial  correla- 
tions. And  os  time  goes  by,  the  spatial  statistics  change  as  matter  gets  clumped 
together.  Some  simulated  slices  of  the  universe  will  help  to  show  the  differences  in 
spatial  statistics.  Figure  1-13  shows  a gray-scale  plot  of  the  matter  distribution  in 
one  such  simulation,  after  fluctuations  have  grown  via  gravitational  instability.  Light 
colors  correspond  to  tow  density  and  dark  colors  to  high  deusity,  and  the  numerical 
fluctuation  values  are  given  according  to  the  color  bar  below  the  slice.  Note  there 
are  numerous  long  pancakes  and  sparse  voids.  Wc  did  not  bother  to  show  a plot 
corresponding  to  early  times,  because  the  plot  would  have  been  nearly  all  one  shade 
of  gray,  corresponding  to  a density  contrast  close  to  zero  everywhere. 

In  Figures  1-13  through  1-16  wc  show  four  such  gray-scale  plots  of  the  matter 
distribution.  All  of  them  have  initial  fluctuations  picked  according  to  the  bell  curve. 
But  the  spatial  statistics  generally  differ.  In  fact,  two  of  the  plots  have  the  same 
spatial  statistics.  A third  plot  has  statistics  close  to  the  these  two,  but  there  is  a 
significant  mathematical  difference.  And  a fourth  plot  has  statistics  which  are  similar 
yet  clearly  different,  even  to  the  eye.  As  a challenge  to  the  reader,  try  to  pick  out  the 
four  plots  so  described  (the  answers  arc  in  the  next  paragraph). 

The  two  statistically  identical  plots  are  Figures  1-13  and  1-16.  Figure  1-14  looks 
similar  to  1-13  but  is  in  fact  significantly  different15  The  statistics  of  Figure  1-15  are 
easily  seen  to  be  different.16 

The  fluctuations  in  Figure  1-14  have  significantly  less  “large-scale  power,”  in  the 

The  fluctuations  in  Figure  1-15  have  a different  "spectrum”  in  the  large  scales,  and 
there  are  more  small-scale  fluctuations. 


Figure  1-12.  Two  practically  identical  distributions  of  initial  density  fluctuations. 
Upon  close  inspection,  these  random  histograms  are  different,  but  they  are  both 
realizations  of  the  same  bell  curve  statistical  distribution. 


8 


Figure  1-13.  The  matter  distribution  in  a simulated  slice.  Light  gray  corresponds 
to  the  mean  density  (density  contrast  0).  Higher  density  regions  are  darker.  These 
structures  evolved  from  small  fluctuations  in  the  initial  density. 


Figure  1-14.  The  matter  distribution  in  a second  simulated  slice. 


59 


density  contrast 


Figure  1-15.  The  matter  distribution  in  a third  simulated  slice. 


60 


Figure  1-16.  The  matter  distribution  in  a fourth  simulated  slice. 


Figure  1-17.  Two  galaxy  maps,  each  containing  about  1000  galaxies,  from  the  same 
matter  distribution,  shown  in  Figure  1-16.  The  top  map  is  unbiased  with  respect  to 
the  underlying  matter  distribution,  whereas  the  bottom  map  is  biased. 


Taking  Figure  1-16  as  an  example  of  the  dark  plus  luminous  matter  distribution, 
what  about  the  galaxies?  We  show  two  galaxy  samples  in  Figure  1-17.  There  arc 
about  1000  galaxies  in  each  slice.  The  spatial  statistics  in  the  top  slice  are  quite 
similar  to  the  statistics  of  Figure  1-16,  but  the  statistical  indicators  are  very  “noisy," 
since  the  galaxy  distribution  is  discrete  while  the  overall  matter  distribution  is  more 
continuous.  Finally,  the  bottom  slice  in  Figure  1-17  looks  simitar,  but  the  galaxies 
are  preferentially  located  in  high  density  regions.  The  statistics  are  quite  similar  be- 
tween the  two  slices,  but  the  bottom  slice  does  have  measurable  statistical  differences 
due  to  fact  that  the  galaxy  distribution  is  biased  with  respect  to  the  overall  matter 
distribution. 

1.7.1  Final  Remarks 

It  is  precisely  because  the  large-scale  structure  problem  is  still  an  epic  “work  in 
progress”  that  we  felt  compelled  to  merely  give  a bit  of  background  and  show  some 
pictures  which  might  convey  the  overall  flavor  of  the  problem.  Even  if  it  is  true  that 
the  big  bang  paradigm  is  correct  and  that  the  gravitational  growth  of  initially  small 
fluctuations  provides  the  correct  framework  for  the  large-scale  structure  problem, 
there  are  many  loose  ends  and  interesting  tidbits  to  keep  scientists  busy  for  many 
years.  It  is  very  important  to  point  out  that  modem  data  do  support  both  of  the 
above  suppositions.  Physicists  strive  for  a complete  theories,  but  concerning  the  large- 
scale  structure  problem  we  “only"  have  a generally  successful  theoretical  framework, 
although  this  is  quite  impressive. 

We  do  not  wish  to  give  the  impression  that  cosmological  work  is  therefore  vague. 
The  experimental  observations  arc  as  precise  as  possible.  They  are  just  like  other 
measurements  in  physics,  except  that  there  are  no  repeat  experiments  (there  is  just 


63 


one  universe)  and  the  sources  of  data  are  so  far  away.  The  theoretical  work  tends 
to  be  quite  particular.  Based  on  the  data,  is  the  galactic  halo  mostly  made  up  of 
baryons,  or  must  some  of  it  be  non-baryonic?  Large  clusters  of  galaxies  give  off  x-ray 
emissions  indicative  of  hot  material;  what  are  the  details?  The  spatial  statistics  of 
galaxies  are  consistent  with  gravitational  instability,  but  do  they  distinguish  between 
various  theories  within  the  overall  framework?  Do  the  statistics  show  the  signs  of 
a bias  relative  to  the  overall  matter  distribution?  What  is  the  rate  of  the  current 
expansion  of  the  universe?  The  list  of  questions  could  get  very  long. 


The  best  overall  reference  on  cosmology  and  large-scale  structure  is  arguably 
"Principles  of  Physical  Cosmology"  by  Peebles  (1993).  This  is  a modem  technical 
reference  but  is  also  designed  to  provide  an  introduction  for  physics  students.  Peebles’ 
approach  is  conservative,  and  he  makes  explicit  note  at  all  points  where  the  discus- 
sion becomes  at  all  speculative.  Though  perhaps  slightly  dated.  “The  Structure  and 
Evolution  of  the  Universe”  by  Zel'dovich  and  Novikov  (1975,  English  edition  1983)  is 
a real  tour  de  force  which  includes  the  basics  as  well  as  material  which  was  somewhat 
speculative  at  the  time. 

For  the  layperson,  there  is  Stephen  Weinberg's  "The  First  Three  Minutes"  (1977), 
which  is  a challenging  read  and  is  widely  regarded  as  scientific  writing  at  its  best.  Alan 
Dresslcr’s  "Voyage  to  the  Great  Attractor"  (1994)  discusses  the  large-scale  structure 
problem  but  also  shows  an  inside  view  of  the  sometimes  soap-opera-like  interactions 
of  scientists.  Finally,  there  are  basic  books  such  as  "Ancient  Light”  by  Alan  Lightman 
(1991)  which  popularise  the  latest  developments  in  cosmology. 

Peebles'  “The  Large-Scale  Structure  of  the  Universe"  (1980)  is  a thorough  techni- 
cal reference  which  also  provides  extraordinarily  detailed  bibliographic  information. 


The  details  and  references  concerning  the  history  of  the  large-scale  structure  problem 
are  thoroughly  covered  in  the  two  books  by  Peebles.  Also,  for  historical  perspective, 
one  may  wish  to  consult  “Selected  Works  of  Yakov  Borisovich  Zel'dovich,  Volume  II, 
Particles,  Nuclei,  and  the  Universe''  (1993).  Zel'dovich  passed  away  in  December  of 
1987.  For  political  reasons,  he  was  not  widely  known  outside  his  home  country.  He 
was  first  allowed  to  visit  the  United  States  in  1980.  for  a scientific  conference.  Let 
us  include  the  following  words  by  astrophysicist  Kip  Thorne,  from  a book  review  in 
Physics  Today. 

The  sweep  of  Zel'dovich 's  career  was  breathtaking. . . . My  colleagues  and  I 
know  him  as  arguably  the  world’s  most  influential  astrophysicist  and  cosmologist— 
and  also  as  a creative  genius  in  general  relativity  and  in  quantum  field  theory 
in  curved  spacetime.  Others  revered  his  contributions  to  chemical  physics, 
fluid  mechanics,  the  theory  of  detonations,  elementary  particle  physics,  nu- 
clear physics  and  the  design  of  nuclear  weapons. , . . This  second  volume  of 
Zel'dovich's  Selected  Works  gives  some  sense  of  his  achievements,  breadth, 
and  power.  (Thome  1994,  p.  61) 


CHAPTER  2 

COSMIC  STRUCTURE  FORMATION 


M BiRBanR  Background 

The  entire  large-scale  structure  problem  of  cosmology  takes  place  within  the 
framework  of  an  expanding  universe  model.  The  background  cosmology  is  gener- 
ally relevant  to  most  cosmological  questions  concerning  galaxy  formation,  large-scale 
statistics,  and  the  like.  However,  the  minute  details  are  not  necessarily  always  cru- 
cial. In  the  simplest  circumstances,  there  may  be  background  parameters — e.g.  the 
the  cosmic  expansion  rate  and  the  deceleration  of  the  expansion — which  are  passed 
down  through  the  large-scale  structure  calculations.  The  dependence  of  cosmic  sta- 
tistical measures  on  these  parameters  is  sometimes  strong,  but  sometimes  not.  At  the 
least  we  need  to  clarify  our  notation  concerning  the  background  cosmology  for  use  in 
subsequent  calculations. 

The  material  presented  in  this  chapter  is  part  of  the  standard  big  bang  litera- 
ture. Concise  reports  concerning  the  reliability  of  the  standard  model  and  the  status 
of  classical  cosmological  tests  may  be  found  in  Peebles  et  al.  (1991)  and  Sandage 
(1992),  respectively.  We  include  basic  elements  of  the  background  model  below.  In 
the  remainder  of  the  chapter,  we  illustrate  the  basic  physics  of  structure  formation, 
introduce  some  of  the  statistical  measures  used  to  describe  large-scale  cosmic  struc- 
ture, and  show  the  connection  between  dynamics  and  the  statistics  of  the  large-scale 
matter  distribution.  This  connection  is  made  through  perturbation  theory  applied  to 
the  fluid  equations  in  an  expanding  background. 


The  background  spacetime  is  defined  by  the  line  element.  Using  the  general 
theory  of  relativity,  we  write  the  invariant  interval  between  two  points  as 

ds2  = gtiudx^dxv,  (2,1) 

where  the  metric  tensor  s,,„  contains  information  related  to  the  overall  matter  and 
energy  content  of  the  universe.  The  spacetime  coordinates  x**  can  be  somewhat 
arbitrary.  Noting  that  the  universe  is  expanding  and  appears  to  be  homogeneous 
and  isotropic,  we  simply  adopt  the  cosmological  principle  which  formalizes  these 
basic  assumptions.  For  the  time  coordinate  x°  we  use  a universal  cosmic  time  (, 
measured  by  any  and  all  observers  “at  rest”  and  thus  following  the  Hubble  expansion. 
Using  time-orthogonal  coordinates,  we  write  the  spatial  part  of  the  line  element  in  a 
manifestly  homogeneous  and  isotropic  form  to  get1 

dJ  = dt2-  n2(f)  + x2(dtP  + sin2#  d*2)]  . (2d!) 

This  is  known  as  the  Robertson- Walker  metric  and  is  reviewed  by  Kolb  and  Turner 
(1990)  and  Peebles  (1993).  The  proofs  of  the  generality  of  the  metric  and  of  homo- 
geneity and  isotropy  are  provided  in  Weinberg  (1972),  Hawking  and  Ellis  (1973),  and 
Misner,  Thome,  and  Wheeler  (1973). 

The  spatial  manifold  is  described  by  the  cosmic  expansion  factor  o(t)  multiplying 
a 3-metric  for  a space  of  constant  spatial  curvature.  The  sign  of  k in  equation  (2.2) 
is  the  same  as  the  sign  of  the  spatial  curvature.  The  curvature  scalar  of  the  3- 
dimcnsional  space  is  the  same  at  all  locations  by  construction,  but  it  depends  on  time 
and  is  given  by  k/a2(t),  so  from  this  expression  and  equation  (2.2)  we  sec  that  the 
overall  geometry  has  a length  scale  set  by  a(().  Any  coordinate  distance  x (specified 
using  integrals  over  x,  0,  and  d for  example)  is  related  to  physical  distance  by 
r = n(()x 


We  set  the  speed  of  light  c to  unity. 


(2.3) 


67 

for  |fcr2|  <£  1.  An  arbitrary  observer  or  “galaxy"  at  fixed  x.  0.  and  0 is  actually  in 
free  fail,  as  a fixed  coordinate  position  is  a geodesic  of  the  metric,  and  therefore  a 
distance  x as  in  equation  1 2.3)  is  called  a comoving  coordinate. 

The  matter  and  energy  content  of  the  background  are  specified  using  a perfect 
fluid  model.  The  homogeneous  energy  density  p depends  only  on  time.  The  homoge- 
neous pressure  p is  assumed  to  lie  p = up,  where  u as  0 corresponds  to  nonrelativistic 
matter  and  u — 3 corresponds  to  ultra-relativistic  particles  or  radiation.  Using  the 
general  metric  (2.2)  in  the  Einstein  equations  with  a perfect  fluid  stress-energy  tensor 
leads  to  two  independent  equations: 


2=-yG(p  + 3p).  (2.5) 

Newton’s  gravitational  constant  is  denoted  by  G.  If  a Ajp„  term  is  included  in  the 
Einstein  equations,  then  a A/3  is  added  to  the  right  hand  sides  of  (2.4)  and  (2.5), 
EVom  the  stress-energy  conservation  law.  we  obtain 

^(po3)  + p^)  = °,  (2.6) 

which  also  follows  from  (2.4)  and  (2.5).  These  equations  plus  a p(p)  equation  of  stale 
describe  the  n(t)  evolution.  Note  that  if  p = up,  then 

p(()~n-3,l+"',  (2.7) 

so  p scales  like  a-3  for  pressurcless  matter  and  a-'1  for  radiation. 

The  redshifts  of  distant  objects  such  as  galaxies  provide  us  with  observable  in- 
formation related  to  the  scale  factor  o(t).  The  redshift  c is  defined  as  the  fractional 


68 


change  of  file  observed  wavelength  A()  relative  to  the  wavelength  A?  emitted  at  the 
galaxy: 


The  wavelength  Af  is  known  once  the  spectral  line  has  been  identified,  since  by  the 
cosmological  principle,  distant  systems  have  the  same  physical  laws.  According  to  the 
geodesic  equation,  the  momentum  of  a free  particle  is  proportional  to  l/a(f).  That 
is,  the  wavelength  of  a photon  stretches,  along  with  other  cosmological  lengths,  in 
proportion  to  a (!).  Therefore,  for  an  observed  redshift  z, 

<u> 

where  here,  and  below,  we  use  a subscript  0 for  our  present  epoch.  It  is  customary 
to  refer  to  cs  as  a recession  velocity,  in  order  to  make  an  intuitive  correspondence 
between  this  cosmological  redshift  and  the  usual  motion-induced  redshift.  We  can  use 
r as  indicator  of  past  epochs.  Today  corresponds  to  z = 0,  the  most  distant  galaxies 
and  quasars  have  redshifts  of  z a 4,  and  the  last  scattering  of  the  cosmic  background 
radiation  at  decoupling  occurred  at  x a 1000. 

The  rate  of  expansion  is  given  by  the  Hubble  parameter 


(2.10) 


At  the  present  epoch  this  is  called  the  Hubble  constant  Ho. 


Ho  — lOOhkms-1  Mpc-1  as  h/(1010 years) . 


(Ml) 


Throughout  our  discussion  we  use  the  megaparsoc  (Mpc)  unit, 


Mpc  as  3.086  x 10I9km  a 3.26  x 106  light-years. 


(212) 


The  large  uncertainty  in  the  measurable  quantity  Ho  is  relegated  to  the  dimensionless 
parameter  ft,  with  a rather  large  window  0.4  £ h £ 1.  Sandage  (1993)  and  Tully 


69 

( 1993)  summarize  some  of  (he  recent  H0  determinations.  The  entire  distance  scale 
question  is  still  at  issue,  and  that  is  why  we  often  refer  to  a distance  as  5 h~*  Mpe 
rather  than  using  a favored  value  such  as  h as  0.7  stemming  from  recent  measurements. 

The  density  in  the  k = 0 case  is  called  the  critical  density  Pc(t).  and  from  the 
Friedman  equation  (2.4),  we  have 


The  dimensionless  density  parameter  17(f)  is  defined  by 

n = £ (2.i4) 
The  spatial  curvature  is  directly  related  to  the  mean  density  of  the  universe.  Using 
the  definitions  of  H and  f!  in  equation  (2.4)  we  get  k = n2W2(ll  - 1).  which  clarifies 
how  the  flat  geometry  at  critical  density  11  = l is  the  bouudary  between  the  negatively 
and  positively  curved  spaces. 

The  evolution  of  a(f)  is  is  also  directly  related  to  the  mean  density  from  equa- 
tion (2.4).  For  p — i'p,  with  a conservative  constraint  v > — g,  integration  of  equa- 
tion (2.4)  says  that  there  tvas  a singularity  sometime  in  the  finite  past  and  that  the 
universe  will  expand  forever  if  k < 0 and  that  the  expansion  will  someday  reverse  if 
*>0. 

In  the  flat  geometry  case,  solutions  for  n(f)  are  given  by  n — f2/3(l+i-)  in  t||G 
early  radiation  dominated  era.  a ~ I1/2.  In  the  current  matter  dominated  era  (which 
began  shortly  before  decoupling),  a ~ l2/3.  In  the  material  that  follows,  we  focus 
solely  on  the  matter  dominated  (p  as  0)  case  which  is  the  relevant  background  for  the 
growth  of  fluctuations  which  we  study.  Since  the  matter  era  dwarfe  the  radiation  era 
in  duration,  we  can  go  ahead  and  use  the  matter  dominated  solution  to  calculate  the 
the  current  cosmic  time,  to  = 5 Wo  and  write 


n(f)  = 


(2.15) 


The  aft)  solutions  for  k 0 and  p = 0 (cf.  Weinberg  1972  and  Zel'dovich  & 
Novikov  1983)  start  out  (in  the  limit  I ->  0)  with  n — t2!3  and  S!  = 1 but  then 
diverge.  We  plot  three  a(t)  solutions  in  Figure  2-1.  For  convenience  we  set  the  current 
(t  = to)  expansion  factor  oo  = 1 and  we  choose  h = 0.5  in  the  current  Hq.  The  open 
(i  = -1)  model  has  Slo  = 0.1:  its  {i  decreases  with  time.  The  closed  (k  — 1)  model 
has  f!0  ~ 2:  its  Q will  increase  with  during  expansion.  The  dimensionless  deceleration 
parameter  q = aa/a 2 is  related  to  the  density,  using  equation  (2.5),  by  q = {1/2.  The 
current  age  is  a function  of  flo  such  that  to  is  11  billion  years  in  the  closed  model. 
13  billion  years  in  the  flat  model,  and  18  billion  years  in  the  open  model. 

The  Hubble  “r  = Hr"  law  is  in  place  once  we  have  r = o(t)  x.  as  in  equation  (2.3), 
since  this  gives  r = Hr.  But  f is  not  measured.  The  cosmological  redshift  z is  the 
measurable  quantity,  and  all  the  Friedman  models  give,  to  first  order  in  z, 

cz=  Hqr  (2.16) 

For  completeness,  we  record  the  full  equation  relating  a physical  distance  r = aqx  to 
a redshift  z (Mattig  1958), 

H°  r = ng(l  + a)  {n0t  + (Ho  - 2)  [(Slot  + 1)1/S  - l] } (2.17) 

ac[z-z2(l/2  + flo)  + - ••]  forz«l.  (2.18) 

In  our  observable  section  of  the  universe,  the  global  background  model  appears  to 
be  a Friedman  mode)  with  0.1  $ Ho  £ 2.  A nonzero  cosmological  coustant  could  still 
be  considered  a possible  addition  to  the  model,  but  opinions  vary  in  the  literature. 

The  present  geometry,  if  not  flat,  is  remarkably  close  to  flat.  This  can  be  seen  from 
the  Friedman  equation.  We  use  equation  (2.4)  in  the  form  that  says  * = a2H2({i  - 1) 
is  a constant  in  time,  combined  with  the  density  evolution  which  says  that  po3(,+1') 


Figure  2-1.  The  expansion  factor  a(t)  vs.  i for  open  (A-  = -1),  dosed  ( A = 1),  and  flat 
(A  = 0)  models.  These  are  matter  dominated  solutions  of  the  Friedman  equation  (2.4). 
and  for  illustrative  purposes,  we  chose  solutions  so  that  a(t0)  = 1 corresponds  to  the 
present  time.  At  to,  we  fixed  Hn  with  h = 0.5.  and  we  set  S)(f0)  = 0.1  (open)  and 
E!(<o)  = 2 (closed).  For  the  flat  model,  f!  = 1 for  all  f,  and  a - l2/3  with  tr>  equal  to 
13  trillion  veers 


t.  So  the  present  i7p.  at  redshift  z = 0.  is  related  to  an  earlier  Q. . 


redshift  z.  by 

“"-■-(iSiOT-  M 

For  example,  if  f!o  = 0.1.  then  at  decoupling  (z  a 1000),  the  value  was  SI  = 0.99,  and 
in  the  very  early  universe  f!  had  to  be  finely  tuned  to  a value  very  close  to  1.  The  line 
tuning  seems  like  an  odd  coincidence,  and  this  is  referred  to  as  the  flatness  problem  in 
early  universe  studies.  Incidentally,  this  indicates  that  the  Einstein-de-Sitter  model 
(k  = 0,  p = 0,  A = 0)  is  a good  approximation  of  earlier  times  and  therefore  it  is  used 


in  many  calculations. 

The  inflationary  theory,  which  is  a very  early  universe  addendum  to  the  standard 
big  bang  model,  generally  predicts  Slo  = 1 to  high  accuracy.  A momentary  cosmolog- 
ical constant  inflated  n(t)  exponentially,  causing  the  k/a2  and  (non-vacuum)  p terms 
to  be  negligible.  Inflation  also  addresses  the  causality  or  horizon  problem  in  the  initial 
conditions  of  our  patch  of  the  universe. 

The  causal  horizon — the  sphere  within  which  ds2  = 0 signals  are  observable — has 
radius  din#  given  by 

dhor  = a(l)  Jg^fy  (2-20) 

and  we  have  dhor  ~ ct  for  the  Friedman  models  (e.g.  with  radiation  or  matter).  We 
often  refer  to  Hubble  radius  c/H  ~ ct  as  the  “horizon”,  and  this  has  the  same  order 
of  magnitude  as  d^or  f°r  our  simple  Friedman  models.  Since  dfrar  ~ t and  a ~ ta 
(a  < 1),  a region  of  proper  size  d (which  scales  like  a),  lies  outside  the  horizon  until 
such  time  as  = d.  Photons  in  the  cosmic  background  radiation  separated  by 
the  horizon  distance  at  decoupling  (z  as  1000)  reach  us  today  separated  by  less  than 
a degree  on  the  sky.  The  uniformity  of  the  cosmic  background  radiation  indicates 
a homogeneity  which  naively  would  violate  a conventional  causal  origin.  This  is  a 
major  puzzle.  In  the  large-scale  structure  problem,  initial  conditions  are  often  simply 


73 

assumed  as  given  at  decoupling,  so  this  issue  is  put  on  the  back  burner.  But  inflation 
does  appear  to  deal  with  it  nicely.  The  causal  structure  is  reworked  during  the  brief, 
accelerated  (d  > 0),  inflationary  phase  which  stretches  causally  related  regions  to 
scales  well  beyond  our  present  horizon.  White  providing  for  large-scale  homogeneity, 

lasts  throughout  the  subsequent  Friedman  model  evolution  up  to  matter  domination. 

background  radiation  photons,  is  2.7  K.  To  bo  exact,  the  COBG  FIRAS  team  has 
determined  T = 2.726  ± 0.010  K (Mather  et  al.  1993)  in  an  exquisite  fit  to  a Planck 

to  pertain  to  the  relic  radiation  of  the  big  bang.  This  background  presently  contains 

the  matter  and  continued  their  adiabatic  cooling  (from  T » 1 eV  a 104  K)  down  to 
their  current  2.7  K.  The  universal  thermal  history  from  the  hadron  era  (T  ta  100  MeV) 
or  even  earlier  (inflation,  baryogenesis),  through  nucleosynthesis  (T  as  1 MeV),  con- 
tinuing through  decoupling  is  reviewed  thoroughly  by  Peebles  (1971, 1993),  Zel’dovich 
and  Novikov  (1983).  and  Kolb  and  Turner  (1990).  Since  the  particle  physics  astro- 

we  choose  to  quote  their  apt  summary  concerning  the  thermodynamic  background  in 
cosmology. 

For  much  of  its  early  history,  most  of  the  constituents  of  the  Universe  were  in 
thermal  equilibrium,  thereby  making  an  equilibrium  description  a good  ap- 
proximation. However,  there  have  been  a number  of  very  notable  departures 
from  thermal  equilibrium — neutrino  decoupling,  decoupling  of  the  background 
radiation,  primordial  nucleosynthesis,  and  on  the  more  speculative  side,  in- 
flation, baryogenesis.  decoupling  of  relic  WIMPs,  etc.  As  we  have  previously 
emphasized,  if  not  for  such  departures  from  thermal  equilibrium,  the  present 
state  of  the  Universe  would  be  completely  specified  by  the  present  temper- 
ature. The  departures  from  equilibrium  have  led  to  important  relics— the 
light  elements,  the  neutrino  backgrounds,  a net  baryon  number,  relic  WIMPs 


(wcaklv-interacting  massive  particles),  relic  cosmologists.  and  so  on.  (Kolb  k 

Turner  1990,  p.  115) 

We  next  present  thermodynamic  material  which  helps  to  set  the  context  for  the 
gravitational  instability  problem.  The  stage  is  set  for  structure  formation  during  the 
radiation  dominated  period,  and  so  we  begin  there.  At  this  point,  fluctuations  in 
the  metric  (that  is  to  say,  perturbations  in  the  gravitational  potential)  are  present 
but  not  free  to  grow.  After  the  transitional  period  of  matter-radiation  equality,  some 
of  the  fluctuations  in  the  matter  distribution — weakly  interacting,  nonbaryonic  dark 
matter  in  particular — are  free  to  amplify  according  to  gravitational  instability.  Prior 
to  decoupling,  pressure  forces  supplied  by  the  photon  radiation  are  sufficient  to  coun- 
teract the  effect  of  gravity  on  the  baryons.  but  after  decoupling  the  baryons  too  can 
participate  in  the  instability  on  cosmologically  interesting  scales.  We  introduce  the 
physics  of  gravitational  instability  in  the  next  section. 

Approximately  three  minutes  after  the  big  bang,  the  radiation  dominated  era  is 
in  full  effect.  The  last  important  period  (t  as  !0_2-102s  or  T as  10-10~!  MeV)  prior 
to  this  time  gave  us  the  annihilation  of  the  e±  species  (leaving  enough  electrons  to 
maintain  neutrality  with  respect  to  the  remaining  protons)  and  the  nuclear  reactions 
which  were  able  to  construct  Helium  nuclei  (as  25%  abundance)  and  small  amounts 
of  2D,  3He,  and  7Li  from  the  protons  mid  neutrons,  leaving  as  75%  Hydrogen  nuclei 
(protons). 

After  nucleosynthesis,  the  equilibrium  soup  contains  photons  (7),  neutrinos  (1/), 
electrons  (e“),  and  baryons  (B).  We  define  T to  be  the  photon  temperature.  The 
(massless)  neutrinos  ore  no  longer  really  in  equilibrium,  having  decoupled  at  T as 
IMeV,  prior  to  which  their  weak  interactions  had  kept  them  in  equilibrium.  Also 
present  is  the  nonbaryonic  dark  matter  which  we  believe  to  have  been  in  equilibrium 
with  the  photons  at  some  point  in  their  history  but  which,  similarly  to  the  neutrinos, 
decouple  from  the  photons  and  ordinary  matter  early  on.  The  equilibrium  condition 


is  not  true  in  the  case  of  axions.  which  do  not  freeze  out  in  the  thermodynamic  sense 
but  fall  into  the  soup  as  a result  of  an  arcane  phase  transition  (cf.  Kolb  4i  Turner 
1990).  The  energy  density  p of  the  universe  during  this  era  comes  from  the  t and  v 


radiation: 


(2.21) 


where  g is  an  effective  count  of  the  degrees  of  freedom  due  to  the  number  of  species  and 
their  spin.  The  pressure  is  p = j/5  due  to  photons.  The  second  law  of  thermodynamics 
applied  to  a comoving  volume  element  implies  that  the  entropy  density  (contributed 
primarily  by  the  relativistic  species)  is 


(2.22) 


during  equilibrium.2  Since  the  entropy  sa3  is  constant,  we  have 


(2.23) 


for  such  periods.  The  entropy  contributions  from  photons  and  neutrinos  have  been 
separately  conserved  since  neutrino  decoupling. 

The  present  value  of  the  critical  density  pc  is 

Pc{to)  « 1-88  x 10-29  A2  gem-3 . (2.24) 


Using  equation  (2.21)  with  T0  as  2.7  K gives  flrod  a 2.6  x l(T5/r2  as  a contribution 
to  the  overall  density,  but  we  know  the  density  today  is  f!0  * 1 (even  with  no 
2 The  p and  s degeneracy  factors  are 


= 2 + 2|(^)4/33 
f = 2 + 2J)T3- 


nonbaryonic  dark  matter,  we  have  fio  re  0.1).  The  universe  is  currently  matter 
dominated.  Direct  observations  of  starlight  from  the  luminous  parts  of  galaxies  give 
(from  galaxy  counts,  the  galaxy  luminosity  distribution,  and  the  mass  to  light  ratio) 
fl|nm  s 0.01.  Such  an  estimate  uses  a mass  to  light  ratio  of  re  10  A in  solar  units 
(Me/L~. ),  but  dynamical  estimates  for  larger  systems,  from  galaxy  halos  to  clusters 
to  large-scale  dynamics,  give  larger  mass  to  light  ratios  (Peebles  1993).  That  dark 
matter  exists  has  long  been  known,  but  the  surprise  is  perhaps  that  the  dark  matter 
seems  to  comprise  much  more  of  the  universe  than  luminous  matter.  The  value  of 
flu  is  not  well  determined,  but  estimates  tend  to  give  flo  0.1,  Note  that  big 
bang  nucleosynthesis  calculations  (which  fare  well  with  observational  light  element 
abundances)  place  an  upper  bound  on  the  baryon  density.  Also,  simple  baryon-only 
models  have  severe  problems  related  to  structure  formation.  Finally,  carrying  the 
least  weight  in  a strict  argument  perhaps,  inflationary  models  generally  predict  that 
flo  — 1 to  high  precision.  Hence  some,  possibly  most,  of  the  dark  matter  must  be 
nonbaryonic. 

Gravitational  instability  is  free  to  operate  in  the  matter  dominated  era.  as  we 
illustrate  in  §2.2.2.  We  can  estimate  the  era  of  matter-radiation  equality.  If  we  simply 
equate  the  radiation  density  js7$(l  + a)4  to  the  matter  density  pc(to)!lo(l  + :)3  we 
find  the  era  of  equality  to  be  a = 4 X 104floA2.  This  transition  occurred  shortly 
before  decoupling  (a  re  1000). 

We  end  this  section  with  an  attempt  to  add  a little  historical  perspective.  The 
historical  development  of  physical  cosmology  and  the  large-scale  structure  problem  is 
well  covered  by  Peebles  (1980,  1993).  This  history  is  not  always  well  appreciated,  as 
noted  by  Fred  Hoyle  (1993)  in  an  assessment  of  modern  cosmology  and  its  history'. 
“There  is  a tendency  nowadays  to  think  that  cosmology  only  began  as  a serious  study 
in  1965.  whereas  in  fact  rather  little  that  is  really  new  has  happened  since  1965“  (Hoyle 


77 

1993,  p.  693).  George  Gamow  (1948a.b)  had  worked  out  esseniial  aspects  of  the  tlieory 
(e.g.  background  radiation  and  a globular  cluster  Jeans  mass)  decades  before  Pcnzias 
and  Wilson  (1965)  published  their  cosmic  background  radiation  discovery.  On  the 
other  hand,  there  really  was  a shift  towards  a more  intense  attack  on  cosmology 
in  general  at  this  time,  as  physicists  became  more  inclined  to  use  a more  rigorous 
approach  to  this  formerly  somewhat  tentative  science.  This  sentiment  is  reflected  by 
Zel’dovich. 


essentially  a question  of  belief.  If  so.  1 do  believe  and  shall  attempt  to  draw 
all  possible  conclusions  front  this  belief,  comparing  these  conclusions  with  the 

not  know  of  any  such  contradiction.  (Zel'dovich  1965,  p.  313) 

2.2  Cosmic  Structure 

2.2.1  Basic  Ideas 

We  live  in  a section  of  the  universe  which  appears  homogeneous  and  isotropic, 
when  averaged  with  a large  enough  smoothing  filter.  Adopting  the  cosmological 
principle  allows  us  to  use  a big  bang  background  model  for  the  overall  evolution  of 
the  univcise.  and  we  consider  fluctuations  with  respect  to  this  background. 

In  regions  smaller  than  an  individual  galaxy,  we  know  that  the  matter  distribution 

is  highly  structured:  molecules,  planets,  stars, Even  if  we  look  at  large-scale 

maps  of  the  galaxy  distribution,  as  shown  in  Chapter  1.  the  distribution  is  still  quite 
lumpy.  The  universe  was  quite  uniform  in  the  radiation  dominated  era.  There  were 
no  galaxies,  stars,  or  heavy  elements.  Small  amplitude  fluctuations  did  exist,  perhaps 
as  a relic  of  an  inflationary  era.  These  were  real  fluctuations  with  a statistical  signal, 


wc  presume,  and  not  siinplv  fluctuations  associated  with  the  discreteness  of  matter 
(such  "minimal"  fluctuations  do  have  their  own  signal,  too  small  to  detect),  Wc 
adopt  the  standard  scenario  for  structure  formation  whereby  the  small  fluctuations 
grew  via  gravitation  into  the  observed  large-scale  structure.  By  focusing  on  large 
scales,  gravitational  instability  is  the  key  ingredient  of  any  model.  The  details  of 
hydrodynamics  and  electromagnetism  can  never  be  ignored,  but  by  not  considering 
galaxy  formation  per  se,  cluster  or  halo  composition,  etc.  we  effectively  side-step  such 
issues  (in  the  particular  areas  which  wc  study,  not  in  the  large-scale  structure  problem 
in  general). 

We  now  begin  to  describe  deviations  from  the  perfectly  homogeneous  background 
and  thus  begin  our  focus  on  the  large-scale  structure  problem.  To  get  a sense  of 
the  numbers,  we  use  rough  ordets  of  magnitude  for  presently  observed  values  of  the 
mass  density,  which  we  denote  by  p.  On  Earth,  we  have  p as  lgcm-3,  while  for 
the  Solar  System,  p as  10-logcm-3.  For  a galaxy,  we  make  a erode  estimate4  that 
M as  lO12A/0  and  d as  0.01  Mpc  to  arrive  at  p as  10-24gcm'3.  The  mean  cosmic 
density  is  p « 10~M  — 10-3^gcm~3.  On  cosmic  scales  it  is  convenient  to  use  the 
dimensionless  density  contrast  <S, 

pfcfcJM  (1.26) 

At  the  present  epoch,  large  structures  have  fluctuations  ranging  from  6 a:  1 to  6 as  106. 
as  we  indicate  in  Table  2-1. 

As  a consistency  check,  we  can  estimate  that  structure  formation  (galaxies,  clus- 
ters, . . .)  was,  in  fact,  a relatively  recent  occurrence— e.g.  that  such  structures  didn’t 
form  in  the  radiation  era  or  near  decoupling.  Since  pa 3 is  constant  in  the  matter 

Since  t he  COBE  detection  of  cosmic  background  radiation  fluctuations  (Smoot  et  al.  1992), 
we  believe  we  have  direct  evidence  of  such  a signal. 

The  mass  of  the  sun  is  Mq  ss  2 x 1033g 


Table  2-1  Density  fluctuations  Tor  large  objects. 


Density  Contrast  Object  Size  1 /i~ 1 Mpej 

10c  Galaxy  0.01 

103  Galaxy  Cluster  1 

few  Galaxy  Supercluster  100 

0 Fair  Sample  1000 


dominated  era,  we  can  write  for  earlier  times  that  p = po(l  + a)3-  Since  we  have 
6 as  1G° . 103.  and  1 for  galaxies,  clusters,  and  superclusteis,  we  can  simply  trace 
p back  to  higher  z,  and  we  get  redsllifts  of  z a 100.  10.  and  1 for  the  correspond- 
ing mean  densities.  If  such  structures  had  separated  out  earlier  (at  higher  r),  they 
would  clearly  have  higher  densities  presently.  A conservative,  detailed  estimate  of 
the  timetable  is  given  by  Peebles  (1993,  §25),  and  we  may  take  a as  5 as  the  galaxy 
formation  era,  and  z as  1 - 3 as  the  era  for  larger  structures. 

The  apparent  motion  of  a galaxy  at  r = ax  is  divided  into  two  pieces,  as  in 

r = ox  + ox  = f/r  + v.  (2.26) 

The  Hubble  flow  Hr  corresponds  to  the  geodesic  solution  of  a point  at  rest  in  the 
expanding  background.  The  velocity  v relative  to  the  mean  rest  frame  is  known 
historically  as  the  peculiar  velocity.  The  peculiar  motions  within  our  local  group  and 
our  infall  toward  the  Virgo  Cluster  are  nearby  examples  of  measurable  v values.  The 
"Fingers  of  God"  effect,  easily  seen  in  the  Coma  cluster  in  the  Slice  of  the  Universe, 
Figure  1-9,  results  from  the  fact  that  the  radial  components  of  v for  galaxies  get 
included  in  the  redshift  measurements  and  distort  the  redshift  map  relative  to  a 
pure  galaxy  position  map.  In  principle,  the  velocity  field  v is  detectable  in  redshift 
surveys — once  the  distance  (e.g.  from  the  Hilly-Fisher  relation)  has  been  subtracted 
out  of  the  redshift — since  the  radial  component  of  v is  sufficient  to  extract  the  entire 


so 

v if  the  velocity  field  is  a gradient  (which  it  is  if  gravitational  instability  is  operating 
in  the  large).  Current  v-ficld  results  are  preliminary,  but  the  technique  will  have  great 
power  since  the  entire  matter  distribution  (whether  luminous  or  not)  is  the  source  of 
the  flow  (Bertschinger  & Dekel  1989).  Equation  (2.26)  also  holds  good  in  our  analysis 
of  the  growth  of  small  fluctuations  in  a fluid  model,  as  v will  be  the  velocity  field 
with  respect  to  the  expanding  background. 

In  the  fluid  model,  our  notation  will  resemble  that  of  conventional  fluid  mechanics. 
The  density  field  p(x,t)  depends  on  space  and  time,  and  we  use  comoving  coordinates 
x,  so  that  x would  be  referred  to  as  a spatial  coordinate  (as  opposed  to  a material 
coordinate).5  We  consider  variations  from  the  background  p(t),  and  the  subsequent 
dynamical  equations  are  more  conveniently  expressed  using  6(x.t),  the  density  con- 
trast. In  the  equations  of  motion  for  the  Newtonian  approximation,  introduced  below, 
the  gravitational  potential  is  connected  to  <5  via  the  Poisson  equation,  and  v(x,  t), 
a(x,  f),  and  <p(x,t)  are  related  using  the  continuity  and  Euler  equations.  The  Eule- 
rian  approach  is  more  natural  in  the  sense  that  measurements  (e.g.  redshift  surveys) 
supply  information  related  to  the  fields  <5(x)  and  v(x)  at  the  current  epoch.  Later, 
particularly  in  Chapter  3,  we  have  occasion  to  nse  the  Lagrangian  method,  where  the 
separation  between  uniform  expansion  and  peculiar  motion  is  used  directly,  in  the 

r(q,t)  = 0(t)[q  + ¥(q,t)|,  (2.27) 

The  spatial  coordinate  formulation  of  fluid  mechanics  is  known  as  “Eulorian",  while 
the  equivalent  material  coordinate  formalism  is  known  as  “Lagrangian".  This  ter- 
minology is  odd,  considering  Euler  developed  both  methods  prior  to  Lagrange  and 
that  Lagrange’s  contributions  were  primarily  to  the  spatial  coordinate  method.  This 
is  clarified  by  Truesdoll  (1954).  In  spirit,  the  names  are  appropriate  (though  not 
accurate),  and  we  keep  with  the  tradition,  os  we  have  occasion  to  use  both  methods. 


where  material  coordinates  q are  specified  when  fluctuations  are  small,  and  the  grav- 
itational instability  is  described  by  the  function  t)  which  supplies  the  deviations 
from  the  Hubble  flow  and  which  leads  to  structure. 

Corresponding  to  any  comoving  length  scale  A is  a wavevector  k,  k = 2 7T / A . in 
Fourier  transform  space.  The  density  contrast  is  considered  as  the  sum  of  plane  wave6 
modes  j(k,t), 

*(».*)  = /(^(k,0efltx-  (2-28) 

The  Fourier  decomposition  proves  useful  in  characterizing  the  initial  density  fluctua- 
tions and  their  evolution.  In  addition,  the  current  matter  distribution,  as  traced  by 
galaxies,  can  be  usefully  described  by  statistics  in  both  the  x or  k domains.  It  is 
often  convenient  to  normalize  the  expansion  factor  nflg)  to  unity,  so  the  comoving 
and  physical  scales  coincide  at  the  present  epoch.  Tracing  a length  scale  backwards 
is  possible,  since  the  physical  scale  is  o(()A. 

2.2.2  Jeans  Instability 

The  Jeans  (1902,  1928)  theory  of  gravitational  instability  is  fundamental  for  a 
basic  understanding  of  the  onset  of  formation.  The  Jeans  instability  is  covered  par- 
ticularly well  in  two  references:  Peebles  (1980),  who  also  covers  its  history  as  applied 
to  cosmology,  and  Zel'dovich  and  Novikov  (1983,  Chapter  10),  who  provide  details 
for  both  a static  and  an  expanding  medium.  We  introduce  some  of  the  physics  below. 

The  essence  of  the  idea  is  most  easily  illustrated  following  Jeans'  approach  for  a 
collisions!  fluid  in  a static  background — he  intended  to  apply  the  results  to  planet 

The  e'k'x  mode  expansion  is  strictly  valid  only  when  the  physical  scale  of  the  modes 
is  much  less  than  the  radius  of  curvature  of  the  background  geometry.  But  this  is  the 
case  now  (even  if  we  are  not  in  an  Einstein-de-Sitter  model)  and  even  more  so  in  the 
past.  In  the  Newtonian  regime  in  which  we  operate,  our  wavelengths  are  much  smaller 
than  the  horizon,  which  addresses  other  caveats  concerning  such  a mode  expansion 
(Kolb  it  Turner  1990). 


and  star  formation  within  a spiral  nebula.  The  gravitational  equations  are  certainly 
scale-free,  and  so  the  concepts  transfer  freely  to  our  problem,  though  there  are  two  key 
reasons  why  the  results  do  not  directly  apply  to  cosmology:  the  universal  expansion 
is  neglected  and  the  methods  are  purely  Newtonian.  The  first  problem  is  crucial 
and  must  be  corrected  for.  The  second  is  not  really  a setback,  as  the  Newtonian 
approximation  which  we  ultimately  use  in  our  cosmological  analysis  is  an  appropriate 
limit  (subhorizon  and  slow  motion)  of  the  full  general  relativistic  analysis. 

The  fluid  equations  (continuity,  Euler,  and  Poisson)  plus  a thermodynamic  state- 
ment (dissipationiess  flow — S is  the  entropy  per  unit  mass)  are 


First  take  a constant  background  solution  (density  p = Pb:  velocity  u - 0.  etc)  and 


(2.29) 


(2.30) 


(2-31) 


(2.32) 


. we  may  seek  as  a sum  of  plane  wave  modes  as  in 


equation  (2.28),  yet  we  write  one  mode  only  for  simplicity: 


p(x,t)  =p4(l+d(t)eilcxl 
u(x,t)  = 0 + v(t)eikx 
<h(x,  1)  = + i>(t)e'k'* 

S(x,J)  = Sj  + s(t)eikx 


(2.33) 


(2.34) 


(2.35) 


(2.36) 

(2.37) 

(2.38) 


p(x,  i)  = Pi + ”';{p  - n)  + a|(s  - Ss) 

v2  = dp 


Op' 


I i I 


i + ik-v  = 0 (2.39) 
v + iki  + ik»*«  + iki  = 0 (2.40) 
- kh  = 4*Gpt4  (2.41) 


i».  « * 0,  and  therefore  equation  (2.42)  implies  that  s = 0.  We  write  S(t)  = 
, v(<)  = v0e-‘.  etc.  and  split  v0  into  one  longitudinal  (curl-free)  and  two  trans- 


v0  = (k'V0)k  + k«(v0*i).  (2.43) 


(2.44) 


y ,l" — ™" d* 


ill 


84 

and  a non-trivial  solution  for  <So,  ip.  and  ®o  exists  only  if  the  corresponding  determi- 
nant is  zero,  which  gives  the  following  dispersion  relation: 

ur  = ±V/4xGpt-e?*:2.  (2.47) 

It  is  clear  from  equation  (2.47)  that  there  are  two  types  of  solutions,  as  the  length 
scale  of  the  perturbation  may  be  greater  than  or  less  than  the  critical  Jeans  length. 


The  unstable  solution  occurs  for  small  k (scales  larger  than  A j)  when  or  is  real 
and  positive,  and  we  may  write  out  the  solutions  from  equations  (2.44-2.48). 

/(x.t)  = SBelk*<r‘  (2.49) 
v(x,t)  = .pJ0(4>rGpt)1^[l  - (k/kj)*] e“"  (2.50) 
#(x,t)  = -<fo^5^eik'Itewl  • (2-51) 

In  particular,  the  6 and  v solutions  are  spatially  out  of  phase  by  rr/2,  so.  for  example, 
a velocity  maximum  coincides  with  a zero  in  5,  as  one  would  expect.  Solutions 
with  or  < 0 arc,  in  fact,  solutions,  but  they  are  related  to  particulars  of  the  initial 
conditions  and  nonetheless  get  swamped  by  the  unstable  solutions  as  t increases. 
Clearly  the  solutions  (2.49-2,51)  are  physical  (as  opposed  to  mathematical  artifacts), 
as  it  is  energetically  favorable  for  the  unstable  solutions  to  grow.  Such  an  instability 
increases  without  bound  (there  is  nothing  in  the  linearized  equations  to  stop  it), 
but  actual  growth  has  stopped  when  an  internal  potential/kinetic  energy  balance  is 
reached  in  the  separated  object,  and  a bound  system  such  as  a star  or  galaxy  is 
made.  And  when  there  is  dissipation,  the  collapse  to  a bound  object  is  even  deeper 
(the  gravitational  potential  is  more  negative,  but  the  kinetic  energy  is  larger,  as  per 
the  virial  theorem).  Large  k modes  (A  < A j)  propagate  like  a sound  wave  (exactly 


so  in  the  limit  k » kj)  with  frequency  vsi[l  - (fcj/i)*]1/®.  There  are  two  waves,  in 
the  ±k  directions,  and  no  instabilities. 

The  qualitative  features  shown  above  for  a fluid  with  pressure  transfer  well  to 
the  cosmological  scenario  as  related  to  baryons  before  decoupling,  when  pressure  is 
supplied  by  the  hot  photon  gas.  The  physical  situation  is  more  accurately  described 
by  a collisionless  fluid  in  the  matter  dominated  era  for  weakly  interacting  dark  matter 
(or  for  baryons  after  decoupling).  Such  a model  uses  a smooth  gravitational  potential 
'1'  and  density  />,  related  by  the  Poisson  equation,  describing  a sea  of  noninteracting 
particles  of  mass  m.  The  phase  space  distribution  function  of  the  particles  is  denoted 
by  F(x,  n,(),  and  with  a phase  space  trajectory  determined  by 

x = u ii=-V$.  (2.52) 

the  dynamics  are  governed  by  the  Vlasov  equation, 

^ + V(Fu)  + V„(Fu)=0.  (2.53) 

This  model  is  described  by  Peebles  (1980,  §9)  and  Zel'dovich  and  Novikov  (1983. 
§10.7),  and  we  show  the  main  ideas,  which  lie  at  the  foundations  of  the  gravitational 
instability  problem.  What  we  present  below  resembles  the  treatment  of  the  sim- 
plest macroscopic,  small  amplitude  plasma  disturbances  (Ichimaru  1986,  Goldston  k 
Rutherford  lvso),  since  the  plasma  discussion  also  begins  with  the  Vlasov  equation 
and  leads  to  similar  dispersion  relations — with  crucial  differences  since  the  gravita- 
tional coupling  and  the  inertial  mass  are  the  same  m,  and  gravity  is  always  attractive, 
in  obvious  contrast  to  the  situation  in  a plasma. 

L sing  similar  notation  to  that  in  the  above  collisional  analysis,  we  perturb  about 
a stationary  background  solution.9  Denoting  the  perturbations  by  /,  5,  v,  and  6.  we 

The  same  “swindle"  involving  the  Poisson  equation  is  used,  but  again  this  problem 
does  not  persist  in  the  case  of  expanding  background. 


f(x.v.I)  = fi(W2)  + /(x,v.i).  (2.54) 

and  the  linearized  version  of  equation  (2.53)  is 

^ + v-V/-V*V,Ft  = 0.  (2.55) 

with  relations  such  as 

v = -V<J  (2.56) 

Vs*  = 4 i,GpbS  (2.57) 

Ptfi  = m J f dv . (2.58) 


(/  = /0(v)ei**+"<,5  = 40 


ewhat  complicated  dispeision  relation 


(2.62) 


87 

This  is  I he  same  gravitational  instability  shown  earlier,  in  agreement  with  equa- 
tion (2.47)  for  small  k.  If  we  set  at  = 0 in  equation  (2.60),  we  find  the  critical  Jeans 
length  A j = 2 x/kj.  Equation  (2.60)  gives  (after  integration  by  parts) 

Ml 

This  is  similar  to  the  collisional  expression  (2.48),  but  now  velocity  dispersion  behaves 
like  the  pressure.  The  stability  at  A < Xj  is  not  due  to  wave  propagation,  but  is  due 
to  free  streaming,  related  to  Landau  damping  in  Plasma  physics. 

As  Zel'dovich  (1965)  points  out,  the  nature  of  the  instability  in  our  expanding 
fluid  model  is  well  illustrated  by  the  physics  presented  above,  though  we  must  include 
the  important  effects  of  universal  expansion  when  we  present  the  equations  of  motion 
and  draw  firm  conclusions,  as  we  do  below.  We  may  gain  some  insight  by  crudely 
modifying  the  ideas  above  to  accommodate  the  fact  that  the  background  density 
depends  on  time.  For  example,  let  us  pick  Einstcin-de-Sitter  expansion;  then  the 
Friedman  equation  (2.4)  gives  o(t)  ~ f2/3  and  in  particular  pb  = (6irCt2)-1.  For  k « 
kj  (a  very  weak  constraint  in  the  matter  dominated  era),  our  plane  wave  perturbation 
solution  (2.49)  applies,  with  e“*  time  dependence,  and  u)  = ±yMir Gpk  independent 
of  k to  good  approximation,  so  a density  perturbation  could  be  represented  as 

d(x, !)  = e"1  J 6k  e‘k  x (2.64) 

= A(t)<Hx),  (2.65) 

with  the  space  and  time  factors  separated.  To  account  for  the  time  dependence, 
we  simply  "integrate”  j = ±u6,  with  u = y/SJSt"1,  and  we  find  that  the  time 
dependence10  of  a perturbation  is  A(t)  ~ go  perturbations  grow  by  a power 

law,  rather  than  by  exponential  growth,  in  an  expanding  universe.  This  turns  out 
10  The  strictly  correct  conclusion,  shown  below,  is  that  the  growing  and  decaying  solu- 
tions arc  .1  ~ t2/3  and  ,4  — t-1 . 


to  bo  crucial,  as  there  is  a sort  of  agreement  in  timing  between  the  instability  and 
the  cosmic  expansion — the  instabilities  do  not  develop  so  quickly  as  to  render  the 
background  model  useless.  This  may  even  be  somewhat  obvious  in  hindsight,  since 
there  is  no  preferred  time  scale  (the  instability  rate  u;  coincides  with  the  expansion 
rate  a/a),  so  that  by  dimensional  analysis  a(t)  and  .4(1)  are  scale  free  power  laws. 
This  not  strictly  true  if.  for  example,  flo  < 1,  because  there  is  a curvature  scale, 
and  eventually  the  instability  .4(t)  is  quelled  (basically  because  the  universe  dilutes 
too  rapidly)  when  ; < Hq1,  Just  as  important  is  the  fact  the  wavelengths  of  the 
perturbations  scale  like  a((),  so  that  in  the  strict  analysis,  the  ,4(i)t!i(x)  split  holds 
true,  but  in  comoving  coordinates  x.  so  the  growth  of  long-wave  perturbations  is 
independent  of  k for  small  comoving  k,  and  the  perturbation  retains  its  initial  shape. 

It  is  customary  to  introduce  the  Jeans  mass.  M j.  which  gives  a lower  bound  on 
the  objects  which  form.  For  a comoving  A j,  this  is  defined  as 

•'O  = • (2-66) 

where  pm  is  the  background  matter  density.  Since  the  critical  wavelength  scales 
with  o(t)  and  fim  varies  like  a(t)-3.  Mj  is  independent  of  these  variations  and  is  a 
particularlv  convenient  quantity. 

To  illustrate  the  cosmological  Jeans  mass,  we  note  that  the  analogs  to  equations 
(2.29)  and  (2.30)  in  the  full  analysis  give  a similar  equation  for  the  evolution  of  a 
mode  Jjt  (k  is  the  comoving  wavevector). 

4 + 2^jk  + - JirCp^  Jk  = 0 , (2.67) 
from  which  we  draw  similar  conclusions  concerning  the  comoving  Jeans  length  A j, 


(2  68) 


The  Jeans  mass  in  a cosmological  setting  depends  on  the  background  energy  density 
(which  varies)  and  the  nature  of  the  matter  (which  also  varies).  Determining  Mj 
in  a detailed  fashion  is  a somewhat  tractable  problem  in  cosmology  (Peebles  19S0. 
and  references  therein),  but  even  so  it  is  unclear  how  the  instability  further  develops 
and  how  the  various  preferred  length  scales  which  do  occur  are  ultimately  related  to 
galaxy  or  cluster  formation. 

To  indicate  rough  orders  of  magnitude  we  show  in  Figure  2-2  the  evolution  of 
the  baryon  Jeans  mass  (in  units  of  A/g)  with  time  (in  terms  of  a/oo  = (1  + a)-1), 
covering  the  radiation  and  matter  dominated  eras.  Details  are  presented  in  Weinberg 
(1972).  Peebles  (1980),  and  Kolb  and  Turner  (1990).  We  now  address  the  qualitative 
trends  of  A/j  shown  in  Figure  2-2. 

Before  matter-radiation  equality,  the  Jeans  mass  increased  as  a3,  which  we  deduce 
from  equation  (2.66),  since  fim  ~ a-3,  oj  = c/3,  and  the  background  radiation  density 
is  proportional  to  o~*.  Simply  using  the  photon  and  neutrino  background  density 
from  equation  (2.21)  and  a baryon  density  of  (somewhat  arbitrarily)  Slg/i2  a 0.05 
leads  to  the  expression  plotted  in  Figure  2-2  for  a £ 104.  Note  that  that  this  radiation 
dominated  regime,  v,  is  essentially  c,  and  that  the  baryon  Jeans  mass  is  roughly  the 
same  as  the  total  baryon  mass  within  the  horizon.  That  means  that  the  perturbations 
that  can  grow  are  on  superhorizon  scales  and  behave  as  6 — t (Lifshitz  1946)  according 
to  the  full  relativistic  treatment  (cf.  Peebles  1980.  §86).  For  104  & s £ 103,  the  sound 
speed  dropped  (but  not  abruptly).  Since  uf  ~ T ~ a-1,  and  we  take  p ~ o-3,  the 
Jeans  mass  is  roughly  independent  of  z here.  After  decoupling,  the  pressure  support 
from  matter,  not  photons,  is  small  and  the  Jeans  mass  drops  precipitously.  We 
show  the  further  decrease  of  Mj  after  decoupling  varying  as  a**3/2,  just  assuming 
that  the  nonrelativistic  matter  cools  adiabatically  (which  implies  T — a-2),  ignoring 


Figure  2-2.  Schematic  diagram  showing  the  barvon  Jeans  mass  Mj  as  a function  of 
the  expansion  factor  a(t)/n(fo).  Also  shown  is  the  Silk  mass  Ms  corresponding  to 
the  photon  diffusion  damping  scale  which  is  in  effect  before  decoupling. 


that  the  intergalactic 


91 

any  reionization,  etc.,  somewhat  inaccurately,  since  we  know 
medium  is  not  such  a quiescent  system. 

Prior  to  decoupling,  there  is  a damping  of  baryonic  fluctuations  due  to  photon 
diffusion.  The  sub-, loans  fluctuations  propagate  like  sound  waves  with  the  photons 
providing  the  pressure  support.  The  diffusion  of  photons  out  of  baryonic  fluctuations 
effectively  damps  fluctuations  below  the  diffusion  length  scale  (Silk  1968).  The  mass 
scale  Mg  (known  as  the  Silk  mass)  so  defined  provides  a lower  bound  on  structures, 
as  fluctuations  with  M < Mg  are  essentially  washed  out.  The  effect  increases  toward 
decoupling  as  the  diffusion  increases,  but  the  effect  ceases  after  decoupling.  Without 
the  damping,  photons  are  trapped  in  a perturbation  by  Thompson  scattering  (the 
y~e~  scattering  makes  for  coupling  with  the  protons  too  since  protons  and  electrons 
are  coupled  electroraagnetically  even  in  the  fully  ionized  plasma). 

We  can  crudely  estimate  the  magnitude  and  trend  of  the  damping  scale  by  esti- 
mating the  diffusion  length  scale  d and  then  taking  Mg  — The  photon  mean 
free  path  is  ~ (nro)“*.  where  a is  the  Thomson  cross  section  and  ne  is  the  electron 
number  density  (which  is  just  that  of  the  baryons,  n„  ~ Ofl/Ai-3).  The  diffusion 
constant  is  ~ A ,c  and  so  the  length  scale  is  d ~ (A,c!)1/2.  This  gives  Mg  - a9/2 
during  the  radiation  era  and  Mg  — a1^  in  the  matter  era  prior  to  decoupling.  Being 
a little  more  careful  with  the  numerical  coefficients,  using  an  ionization  fraction  0.1 
at  decoupling  (Kolb  k Turner  1990),  we  have 

Mg  = 6 x lO12(fio/ns)3/2(nofl2)'S/4Af0  (2.69) 
at  decoupling.  Note  that  A/5  < Mj  in  Figure  2-2. 

A universe  with  12  0 contributed  by  baryons  only  is  practically  ruled  out.  due 
primarily  to  problems  relating  to  cosmic  background  radiation  &T/T  measurements 
and  to  primordial  nucleosynthesis  constraints.  Given  the  structure  we  see  today, 
the  i{x)  values  at  decoupling  would  have  to  have  been  fairly  large,  corresponding 


background  temperature  anisotropies  on  the  order  of  AT/T  at  10“2  — 10”*.  This 
has  been  a clear  problem  for  years,  given  the  decreasing  upper  bound  constraints 
placed  on  AT/T  by  experimentalists,  which  finally  led  to  a statistical  detection  of 
AT/T  at  10”5  fluctuations  (Smoot  et  al.  1992).  The  nucleosynthesis  problem  stems 
from  the  robust  predictions  of  the  light  element  abundances,  which  proved  to  be  one 
of  the  experimental  foundations  of  big  bang  models.  The  small  baryon  number  of  the 
universe  is  parametrised  by  the  baryon  to  photon  ratio  t ) - ng/n-,.  We  can  estimate 
the  current  q as  follows.  Since  ng  = pc(t)Qg/mp,  and  n,  = (<(3)/tr2)s2'3,  with 
<(3)  a 1.2  and  T = 2.73  K.  we  have  (Kolb  & Turner  1990) 

i)at  2.7  x 10”8flsfl2.  (2.70) 
The  successes  and  constraints,  e.g.  3 x 1(T10  S q £ 10  x 10”10,  provided  by  the 
nucleosynthesis  calculations  seem  to  indicate  that  the  dark  matter  we  detect  in  halos 
and  large  structures  must  be  (at  least  in  part)  nonbaryonic. 

In  the  case  of  nonbaryonic  dark  matter,  structure  formation  is  qualitatively  similar 
to  the  baryon  scenario  briefly  described  above,  but  with  significant  differences  in  that 
the  dark  matter  models  are  able  to  overcome  the  problems  of  the  baryon-only  models. 
However,  there  are  many  different  dark  matter  scenarios,  and  predictions  tend  to  be 
very  model-dependent,  and  no  particular  model  can  claim  champion  status. 

We  note  some  general  features  of  a theory  in  which  nonbaryonic  species  with 
nonzero  mass  decouple  early  yet  remain  cosmologically  significant.  Details,  including 
some  of  the  particle  physics  background,  are  covered  by  Kolb  and  Turner  (1990). 
The  collisionless  Jeans  mass,  from  the  Jeans  length  of  equation  (2.63),  still  pertains, 
with  the  velocity  dispersion  (ti2)  being  the  analog  to  the  sound  speed  due  to  pressure 
(though  pressure  is  not  the  right  term  in  the  collisionless  cose  or  WIMPS— weakly 
interaction  massive  particles).  Damping  due  to  free  streaming  is  effective  on  scales 
smaller  than  the  Jeans  length  (with  \IFS  < Mj  in  general,  in  analogy  to  the  case 


93 

of  photon  diffusion  damping),  so  strictly  speaking  one  would  speak  of  rather 
than  Mj.  A full  analysis  (Bond  k Szalay  1983)  shows  that  there  are  two  chief  scales 
which  occur,  Xps  and  dilor  at  the  epoch  of  matter-radiation  equality,  though  these 
length  scales  are  not  generally  sharply  defined  in  the  “initial”  spectrum  of  density 
fluctuations  S( k)  at  high  redshift  (z  as  1000).  Decoupling  docs  not  influence  matters 
at  all,  in  contrast  to  the  baryon  case.  Gravitational  instability  is  free  to  operate  in 
the  nonbaryonic  component  before  decoupling,  which  overcomes  the  timing  or  A T/T 
problems  of  the  baryon  only  model.  If  the  dark  matter  fluctuations  have  grown  by 
the  time  decoupling  occurs,  then  the  baryons  fluctuations  (already  damped  on  small 
scales)  begin  growing,  and  in  time  they  mimic  the  nonbaryonic  fluctuations  if  the 
WIMPS  provide  the  dominant  source  of  gravitation. 

These  models  vary  widely,  and  they  generally  are  susceptible  to  some  sort  of 
experimental  verification.  For  example,  the  simplest  Hot  Dark  Matter  (HDM)  and 
Cold  Dark  Matter  (CDM)  seem  to  be  ruled  out.  If  we  ignore  the  hydrodynamics  of 
collapse  and  galaxy  formation  (which  cannot  be  ignored  in  general),  then  the  various 
models'  properties  show  up  in  the  spectrum  of  small  initial  density  fluctuations,  when 
; > 1000. 

3.2.3  Equations  of  Motion 

We  study  perturbations  to  the  homogeneous  expanding  spacetime  with  line  ele- 

ds2  = dt2-a2(t)dx2.  (2.71) 
The  full  machinery  of  general  relativity  is  not  needed  in  our  study,  as  we  confine 
ourselves  to  the  slow  motion,  subhorizon  limits.  The  Newtonian,  weak  field  limit  has 
a line  clement  which  we  may  write  (e.g.  for  a spatially  flat  background)  as 


ds2  = (1  + 2y>)  dl2  - (1  - 2j)  o2(f)  dx2 , 


(2.72) 


where  - is  the  Newtonian  gravitational  potential  coupled  to  the  density  fluctuations 
via  the  Poisson  equation,  as  we  describe  below.  The  constraint  is  essential  to 

the  validity  of  the  limit,  but  this  does  not  at  all  preclude  large  values  of  ti(x, t).  the 
density  contrast  field.  The  line  element  (2.71)  about  which  we  perturb  can  be  open, 
closed,  or  flat  (12  = 1).  though  we  certainly  use  approximations  based  on  12  ss  1,  for 
example  when  we  use  the  e1*'*  mode  expansion. 

The  equations  of  motion  in  the  fluid  model  will  be  the  direct  analogs  of  the 
usual  continuity,  Euler.  Poisson,  and  pressure  p(p)  equations,  modified  to  apply  in  an 
expanding  universe  in  comoving  coordinates  x.  Before  we  write  out  those  equations, 
let  us  separate  out  from  the  equations  for  a mass  m in  an  expanding  background  the 
parts  pertaining  to  the  expansion,  thus  leaving  the  perturbed  motion. 

The  proper  coordinates  r and  comoving  coordinates  x are  related  by 

r = o(t)x  (2.73) 

r = dx  + v(x,f)  (2.74) 

where  the  peculiar  velocity  v = ax  has  been  separated  from  the  Hubble  flow.  The 
time-time  component  of  the  Einstein  equations  and  the  geodesic  equation,  in  the 
Newtonian  limit,  subsuming  the  expansion  and  using  r coordinates,  are  the  familiar 

Vr24>  = 4 xGp  (2.75) 

f = — Vr*  (2.76) 

where  the  density  is  p(l  +4).  We  neglect  pressure  and  cosmological  constant,  which 
could  in  principle  be  included:  such  effects  arc  included  in  Peebles  (1980).  Separating 
out  the  part  of  4>  due  the  expansion  (using  equations  (2.75)  and  (2.5),  which  follows 
here  from  (2.76)  also),  we  write 

4*  — <p(x,t)  + -irGpa^  — <p(x,  t)  — ^oax2 . 


(2.77) 


Thus  we  have  the  Poisson  equation  written  in  comoving  coordinates, 


V2tf(x.  ()  = 4ttG/m2i5(x.  t) , (2.78) 

which  couples  the  density  fluctuations  to  the  perturbations  in  the  expanding  back- 
ground metric.  For  brevity,  we  denote  Vx  by  V. 

All  this  is  more  succinctly  noted  (cf,  Peebles  1980)  if  we  start  with  the  action  S. 
S = -m  [ dr  = - f ^/ifoudxl'dx" 

r ri  , , »™> 

= J dt  [jm(dx  + ox)2  - md»j  + const. 

Then  we  let  the  usual  L = T — V as  we  have  above  go  to  L-v  with  a canonical  trans- 
formation, y = 2uul'lJ~  This  transformation  makes  good  the  separation  between 
the  Hubble  flow  and  peculiar  motions,  giving 

(2.80) 

As  noted  above,  and  6 arc  coupled  by  the  Poisson  equation  (2.78).  The  Euler- 
Lagrange  equations  from  (2.80)  give 

dp  <1  , o . 

dt  = 7t^maX^  = (2.81) 


5?  + iv  = -^V*>.  (2.82) 

We  now  write  the  equations  of  motion  of  self  gravitating  matter  in  an  expanding 
universe.  Since  we  are  able  to  separate  out  the  background  Hubble  flow,  utilizing 
fi(l)  ~ a~'  and  the  Friedman  equation  in  the  physical  r coordinates,  the  equations  in 
our  fluid  model  will  be  written  in  terms  of  S(x,  t)  and  v(x,t)  in  comoving  coordinates 


There  are  two  approaches  (Peebles  1080,  §9)  which  lead  to  the  same  fluid  equa- 
tions, the  ideal  fluid  and  coltisionless  matter  approaches. 


If we  consider  the  particle 


free  path  to  be  small  and  that  the 


filled  with  a self  gravitating,  Newtonian  ideal  fluid,  then  we  may  begin  wit  h the  usual 
continuity  and  Euler  equations  in  r coordinates.  Then  if  we  change  coordinates  from 
r to  x = r/o  and  follow  through  with  the  complete  change  of  variable  procedure  for 
the  density  p(r/a.t ) and  velocity  r = Hr  + v(r/a.(),  the  8uid  equations  become 


Note  that  we  can  rewrite  (2.83)  using  d(x,  t)  as  ad  = - V ■ [(1  + d)v). 

The  collisionless  approach  is  more  natural  for  the  development  of  large  structures. 
We  have  already  separated  out  the  expansion  in  the  single  particle  dynamical  equa- 
tions above.  The  smooth  gravitational  potential  yi(x,  ()  determines  d(x,  t ),  describing 
nontntcracting  (but  for  gravity)  fluid  elements  of  mass  m (which  could  be  dark  matter 
or  galaxies,  for  example).  The  one  particle  distribution  function  is  /(x,  p,  (),  where 
the  canonical  momentum  is  p = ma2v  as  in  equation  (2.81).  The  background  density 
is  mn,  so  the  comoving  number  density  no3  is  constant.  The  first  three  moments  of 
/(x,  p,  ()  are  defined  as 


(2.83) 


0v  1 a I l 

— -+-  -v  • Vv  + -v  = Vp Vp. 


(2.84) 


(2.85) 


(2.86) 


The  dynamics  of  / is  specified  by  the  Vlasov  equation. 

df  p 

ZiF  + '^p  = 0 • 


(2.87) 


Taking  moments  of  the  Vlasov  equation  gives  the  continuity  and  Euler  equations 
for  collisionless  fluid  in  an  expanding  background: 

^ + ^V|(l+d)v|  = 0 (2.89) 

^ + iv.Vv  + % = -iv1p.  (2.90) 

These  equations,  combined  with 

vV(x,  ()  = 4trG/M2<S(x,  t) . (2.91) 

are  the  desired  equations  of  motion.  We  can  combine  (2.89)  and  (2.90)  to  obtain  an 
important  equation  governing  density  fluctuations  (Peebles  1980): 

324  adS  .,1  1 

gjj  + 2-gj  - **GpS  = 4tr GpS2  + - V4  + -jVjV^((l  + 4)nV] . (2.92) 

Our  approach  will  be  to  use  perturbation  theory  based  on  these  equations. 

2.2.4  Linear  Pertnrhation  Theory 

We  begin  here  by  laying  out  the  simplest,  yet  important,  case  of  linear  pertur- 
bation theory.  Early  works  by  Doroshkevich  and  Zol’dovich  (1963)  and  Zel'dovich 
(1965)  clarified  the  nature  of  perturbation  theory  in  the  Newtonian  approximation. 
Papets  which  went  beyond  fiist  order  theory  were  supplied  by  Zel'dovich  (1970), 
Peebles  (1980).  Fry  (1984),  and  Goroff,  Grinstein,  Rey,  and  Wise  (1986). 

If  we  keep  only  the  lowest  order  terms  in  S and  v in  the  fluid  equations,  we  have 
as  our  system  the  linearised  continuity  and  Euler  equations 


(2.93) 


along  with  the  Poisson  equation.  For  the  evolution  of  6,  we  need  only  keep  the  left 
hand  side  of  (2.92),  which  gives  us 

g^  + 2~-dwGAS  = 0.  (2.95) 

The  unperturbed  configuration,  v = 0,  6 = 0,  ^5  = 0,  is  now  an  actual  solution  to 
the  equations  of  motion,  in  contrast  to  the  case  of  a static  background  for  which  we 
performed  the  Jeans  analysis  in  §2.2.2.  Note  that  if  we  include  pressure,  as  we  wrote 
in  the  ideal  fluid  equation  (2.84).  and  expand  in  e‘k'*  modes  with  physical  wave- 
length 2s a/ft,  we  arrive  at  the  expanding  Jeans  analysis  equation  (2.67)  advertised 
previously. 

Now  we  write  the  leading  order  terms  in  the  perturbation  theory  expansion.  First, 
note  that  the  6 evolution  equation  (2.95)  is  completely  local,  and  the  solution  sepa- 
rates immediately  to 

J(x,t)  = /l(t)Jo(x).  (2.%) 

So  any  initial  fluctuations,  Jo(*)  = <S(x,  (,)  at  some  early  time  tj,  in  comoving  coor- 
dinates x.  retain  their  shape  and  grow  in  amplitude.  The  A(t)  solutions  to 

A + 2-A  - 4nGpA  = 0 (2.97) 

are  covered  thoroughly  by  Peebles  (1980).  For  the  S50  = 1 Einstein-de-Sitter  case, 
or  early  when  it  is  effectively  valid,  1 + z » |[}j‘  - 1|  and  the  curvature  term  in 
the  Riedman  equation  (2.4)  is  negligible,  we  have  a ~ t2/3  and  6 irGpt2  = 1,  so  that 
equation  (2.97)  reduces  to 

* + 3 tA~3?A  = ° ,298) 

which  has  two  power  law  solutions. 


A{t)  ~ f2/3.  .4(1) . 


99 


So  J(x.  fl)  is  the  sum  of  two  independent  solutions,  one  growing  and  the  other  decaying, 
with  the  particular  linear  combination  depending  on  initial  conditions.  We  only 
focus  on  the  f2^3  solution,  which  soon  dominates  the  t_1  term  and  leads  to  structure 
formation. 

As  we  did  in  the  Jeans  analysis,  we  can  split  the  peculiar  velocity  into  curl-free 
(longitudinal)  and  divergence-free  (transverse)  pieces.  Only  the  longitudinal  part 
is  coupled  to  gravity  in  the  Euler  equation.  Since  a potential  force  term  cannot 
generate  vorticity.  we  have,  in  analogy  to  Kelvin’s  conservation  of  circulation  in  static 
a background,  that  the  transverse  piece  and  the  vorticity  V'v:  decay.  This  can 
be  seen  from  the  curl  of  the  linearized  Euler  equation,  as  only  the  transverse  piece 


which  implies  vx  = Vx  F(x)/a  for  some  F.  We  arrived  at  the  conclusion  from  linear 
theory,  but  it  is  true  in  general,  as  (2.100)  is  reproducing  the  Kelvin  result  that 


is  constant  around  a material  loop.  No  vorticity  can  be  generated  in  the  large. 
Some  dissipative  small  scale  processes  will  do  so.  Similar  conclusions  hold  in  the 

begin  mixing  (Peebles  1980,  Zel'dovich  and  Novikov  1983).  Consequently,  we  concern 
ourselves  with  potential  flow  for  v. 

The  linear  v solution  follows  simply  once  we  know  that  <i  = /l(t)rt‘o(x)  and  we 
can  write  the  velocity  as  a gradient. 


g 

— (oV  x vj  =0 


(2.100) 


(2.101) 


(2.102) 


From  the  continuity  equity  equation  a6  = V - v and  6/6  = A/ A,  we  have  V2^„  ~ 6 
so  <fiv  ~ <p  up  to  time  dependent  factors.  Getting  the  factors  correct  from  the  Poisson 
equation  (2.91),  we  have 

' — O'1® 

So  we  see  that  in  linear  theory,  v is  parallel  to  the  peculiar  acceleration 

g = --Vif.  (2.104) 


As  for  time  dependence,  with  the  hist  order  6 = .4(t)£o  solution  as  the  source 
in  the  Poisson  equation,  we  see  that  the  linear  gravitational  potential  <p  has  time 
dependence 


»m) 

so  it  is  nearly  constant  in  time.  For  (2  = 1,  the  linear  potential  is,  in  fact,  time 
independent.  The  peculiar  velocity  solution  has  v ~ a.4,  and  for  (2  = 1,  (2.103)  is 
simply  v = g t,  (so  v ~ in  that  case). 

The  expressions  involving  A may  be  rewritten  (Peebles  1980)  by  defining 


(2.106) 


which  characterizes  the  .-1(!)  solutions  conveniently.  For  (2  = 1,  we  have  / = 1.  For 
(2  asymptotically  near  1,  / at  (I4/7,  and  / « (20,6  is  a good  numerical  approximation 
for  smaller  (2.  The  expressions  for  the  linear  v and  6 can  thus  be  written  as 


The  general  ip  solution  to  the  Poisson  equation  may  be  wi 

i 


(2.109) 


101 

where  A is  a rescaled  gravitational  potential. 

V2A  = 6 (2.110) 

? = 4wGpa2A.  (2.111) 

The  Poisson  solution  is  used  self  consistently,  order  by  order,  in  perturbation  theory. 
In  linear  theory,  the  rescaled  gravitational  potential,  with  time  dependence  A ~ A(t), 
is  related  to  6 by  (2.109)  with  the  first  order  6 = A(t)6o  solution  as  the  source. 

The  linear  theory  presented  here  is  strictly  valid  only  for  (i2)  <K  1.  and  on 
large  (yet  subhorizon)  scales,  the  simplifications  made  here — using  Newtonian  limit, 
neglecting  non-gravitational  physics— arc  appropriate.  It  would  seem  that  current 
large-scale  fluctuations  would  retain  some  of  the  information  from  the  linearly  evolved 
quantities.  At  the  least,  we  have,  for  example,  that  an  initial  (t  = t.)  perturbation 
spectrum  will  evolve  in  time  like  .42|5(k,  t,-)|2  for  small  k (large  scales).  In  fact,  going 
beyond  simple  first  order  theory  seems  to  be  more  fruitful  than  might  at  first  be 
expected  (Zel’dovich  1970,  Fry  1984),  Large-scale  statistics  in  the  galaxy  distribution 
seem  to  be  related  to  the  dynamics  provided  by  perturbation  theory.  The  relationship 
is  difficult  to  untangle  in  practice,  as  real  observational  data  rarely  have  a clean 


til. 


Correlations 


In  an  attempt  to  characterize  the  large-scale  matter  distribution,  we  utilize  cos- 
mological statistics  which  are  in  principle  measurable.  The  matter  density  p is  directly 
related,  we  assume,  to  the  galaxy  number  density  ng,  and  we  use  the  dimensionless 


density  contrast  fields: 


<S(x)  = ? (2.112) 
d„(x)  = n*{X)~'h-  ■ (2-113) 

The  peculiar  velocity  field  v(x)  is  measurable  in  principle  if  it  is  a gradient.  Since  v 
is  fundamentally  related  to  d(x)  and  not  the  potentially  biased  5S,  statistics  involving 
velocities  may  have  great  value  in  the  future,  but  current  data  arc  only  preliminary. 
We  focus  on  the  density.  From  the  equations  of  motion,  the  ip,  v.  and  6 fields  arc  all 
amenable  to  study. 

Confining  our  observational  interest  to  regions  < 1000  h~ 1 Mpc,  we  arc  effectively 
ignoring  lookback  times.  We  somewhat  cavalierly  treat  galaxies  as  simple  objects 
which  trace  the  mass,  with  the  possibility  of  a simple  local  biasing  relation  6g  = f{6). 
The  statistics  of  galaxy  properties  and  their  evolution  is  an  entire,  related  field  of 
study  and  is  reviewed  by  Kron  (1995).  The  observable  density  field  naturally  uses 
physical  coordinates,  so  we  could  write  the  measured  density  as  <5(Xp/,y3).  We  may 
take  Xpftj,,  as  coinciding  with  the  comoving  coordinates  at  the  present  time  if  xpll#,  = 
ox  and  a(to)  = 1 presently. 

One  of  the  central  problems  surrounding  the  data  stems  from  the  immense  diffi- 
culties in  amassing  deep  galaxy  surveys.  The  photometric  selection  criteria  and  the 
luminosity  distributions  for  the  survey  galaxies  can  in  principle  be  accounted  for  by 
a selection  function  which  gives  various  statistical  weights  to  the  galaxies  included  in 
statistical  estimators  involving  eff.  We  do  not  include  such  weights  in  formulas  below 
(they  are  included  implicitly  when  any  statistic  is  to  be  estimated  from  real  data). 

There  is  a fairly  rich  history  of  attempts  to  use  statistical  measures  on  the  galaxy 
data,  and  this  is  covered  by  Peebles  (1980).  As  the  quality  and  quantity  of  data 
improved  in  the  1960’s  and  1970’s,  correlation  statistical  methods  were  pioneered  by 


103 

Peebles  and  his  collaboratois.  Counts-in-cell  statistics,  the  void  probability  function 
in  particular,  have  proved  useful.  Statistics  related  to  shape,  topology,  and  percolation 
should  prove  useful,  though  no  standard  has  really  taken  hold. 

Note  that  the  initial  conditions  on  rf(x,  t,-)  are  random  in  nature,  usually  given 
by  the  spectrum  of  perturbations  in  Fourier  space.  Early  universe  theories  such  as 
inflation  can  specify  the  statistics  of  the  primordial  fluctuations  (and  this  is  one  of 
their  main  selling  points  as  theories).  Predictions  concerning  structure  formation  are 
statistical.  We  do  not  wish  to  predict  the  location  of  the  Coma  cluster  in  the  Great 
Wall,  or  predict  the  exact  origin  of  the  local  supercluster,  from  initial  conditions. 
Some  conclusions  are  intuitively  clear.  For  example,  if  a model  is  not  able  to  produce 
“enough"  large  voids  and  filaments  ill  its  modeled  galaxy  distributions,  then  the 
theory  is  lacking.  The  statistics  we  study  are  similar  to  correlated  noise,  such  as 
what  we  might  study  in  investigating  the  choppy  surface  of  water  in  a swimming 
pool  or  the  random  peaks  and  troughs  in  the  ocean's  water  surface,  assuming  the 
coherence  length  of  the  noise  is  sampled  many  times  over  in  our  survey  of  the  random 
water  surface. 

We  consider  <i(x)  to  be  a homogeneous  and  isotropic  random  field.  This  is  based 
on  the  Cosmological  Principle,  along  with  an  crgodic  hypothesis,  whereby  a large 
enough  sample  in  our  local  univetse  (in  the  NGP  direction,  say)  is  an  equivalent 
realization  of  the  same  random  process  as  compared  to  a survey  in  the  SGP  direction. 
This  is  formalized  by  Peebles  (1980)  and  is  called  the  Fair  Sample  Hypothesis.  The 
Slice  of  the  Universe  survey,  Figure  1-9,  is  not  quite  a fair  sample.  The  new  LCRS 
slices.  Figure  1-10,  seem  to  show  the  requisite  large-scale  homogeneity. 

If  we  take  a large  volume  V which  we  assume  to  be  a fair  sample,  then  we  define 
the  important  2-point  correlation  function  as 


f(r)=(d(x  + r)if(x)). 


(2.114) 


where  the  (...)  indicates  an  average  over  V (which  we  take  as  equivalent  to  an 
ensemble  average).  By  homogeneity  and  isotropy,  the  dependence  is  on  r = |r|. 

As  we  have  introduced  it.  £(r)  is  the  autocorrelation  function  of  a continuous 
density  field  £(x).  If  we  take  the  continuous  field  as  our  starting  point  (as  the  fun- 
damental realization  of  the  random  process),  then  we  may  invoke  a Poisson  model  to 
get  a discrete  realization  of  £(x),  which  gives 

dP  = n*[l  + ?(r)|dP1dV'2  (2.115) 

as  the  joint  probability  that  two  objects  are  found  in  volume  elements  dVj,  d\  o 
separated  by  r.  Or  we  may  start  with  a fundamentally  discrete  model  which  defines 
£(r)  by  (2.115).  The  continuous  and  discrete  approaches  are  related  throughout  the 
correlation  hierarchy,  as  described  by  Peebles  (1980),  Bertschinger  (1992),  and  Fry 
(1997) 

Equation  (2.115)  has  a nice  conditional  probability  interpretation:  if  we  choose 
an  object  at  random  from  the  realization,  the  probability  that  a neighbor  resides  at 
distance  r from  it  at  volume  dV  is 

dP  = rldV[l  + £(r)j . (2.116) 

This  gives  a way  to  directly  estimate  £ from  galaxy  counts,  by  comparing  the  actual 
counts  to  those  of  a random  catalog.  A robust  result  for  r g 10h~l  Mpc  is  that 

with  t =5  5 ft-1  Mpc  and  y as  1.8.  while  for  larger  scales  £ is  not  well  measured. 
The  APM  angular  2-point  function  of  as  2 x 106  galaxies  gave  a deprojected  £(r)  in 
agreement  with  (2.117),  os  did  the  earlier  Lick  Observatory  “million  galaxies"  counts. 
Though  the  APM  £ decreased  at  scales  10  h-1  Mpc.  the  amplitude  was  large  in 
comparison  with  basic  CDM  predictions,  for  example. 


The  .V-point  correlation  function  is  given  by 


dP  = n'*'  dV i - - - dKv|l  + N-point-function) , (2.118) 

with  a probabilistic  interpretation  as  in  equation  (2.115).  In  particular,  the  3-point 
function  is  given  by 

dP  = n3dV\dV2dVj[l  + + (23  + {31  + <123) . (2.119) 

where  £12  = {(|x,  -X2I),  and  <123  depends  on  three  numbeis,  the  sides  the  xj,  X2. 
X3  triangle  defined  by  the  three  points.  The  connected,  or  curaulant.  3-point  function 
is  C.  and  it  is  this  which  is  usually  simply  called  the  3-point  correlation  function  in 
cosmology.  A similar  expression  (Fry  & Peebles  1978)  defines  the  reduced  4-point 
function  in  cosmology'.  In  general,  as  in  (2.119),  the  iV-point  function  is  a combina- 
toric sum  over  lower  correlations  ending  with  a final,  connected,  Ar-point  term.  The 
correlation  hierarchy  and  the  underlying  mathematical  details  are  thoroughly  covered 
by  Fry  (1997),  Bertschinger  (1992),  Peebles  (1980),  and  White  (1979). 

The  third  moment  of  6 gives 

(d(x)d(x  + r)(S(x  + a))  = C(r.  s,  |r  - a|) . (2.120) 

and  we  see  that  C is  the  random  field  analog  to  the  skewness  of  of  a probability 
distribution  function.  The  4-point  reduced  correlation  function  rj  comes  from  the 
fourth  moment  of  6 , 

0,234  = (MO-P i4)  <M4>  - (Mi)  (hh) , (2-121) 

so  0 is  the  analog  of  the  excess  kurtosis. 

Direct  estimations  of  the  .V-point  galaxy  correlations,  Peebles  and  Groth  (1975) 
for  the  3-point  and  Fry  and  Peebles  (1978)  for  the  4-point,  show  a hierarchical  form 


106 


I Fry  1984)  whereby  higher  order  correlations  are  constructed  from  the  2-point  func- 
tion. The  simplest  case,  which  is  observationally  robust,  is  that 


for  r ^ 3h-1  Mpc,  with  Q as  1. 

We  now  consider  counts-in-ceils  statistics,  which  we  relate  to  integrals  over  the 
correlation  functions.  Let  us  randomly  place  a cell  of  fixed  size  (as  8/i_lMpc  in 
dimension,  for  example)  and  shape  repeatedly  throughout  a galaxy  distribution.  The 
counts  N in  the  cell  of  volume  V will  be  our  random  variable.  The  mean  counts  per 
cell  is  IV  = fijV',  where  ns  is  the  number  density  of  galaxies.  By  analyzing  the  cell 
counts  in  V'.  further  chopping  V into  iufinitesimal  volumes  and  using  the  £ and  < 
point  process  definitions  (2.115)  and  (2.119).  we  obtain  the  following  expressions  for 
the  moments  of  the  N frequency  distribution  (Peebles  1980), 


Apart  from  the  discreteness,  or  shot  noise,  terms,  we  see  that  the  moments  of  counts 
give  correlation  functions  integrated  over  a volume  V. 

112 Fourier  Spectra 

It  is  often  most  convenient  to  work  in  the  Fourier  domain.  For  the  transform  pair 
of  a continuous  density  field,  we  have 


<123  = Q «12<23  + <23<3I  + <31<12> 


(2.122) 


(2.127) 


(2.128) 


107 


The  Dirac  della  function  is  represented  by 


bW  = 


(2.129) 


Since  <5(x)  is  real  (naturally),  we  have  the  for  the  complex  d(k)  field. 


i(-k)=i(k)*. 


(2.130) 


If  d(x)  is  a realization  of  a homogeneous,  isotropic  random  process  with  2-point 
spatial  correlations  ={^i^2)<  then  we  may  compute  the  second  moment  of  6 to  be 


where  P(k)  is  the  power  spectrum,  which  forms  a transform  pair  with  £(r), 


In  this  second  moment  calculation,  we  see  that  the  "momentum  conserving"  delta 
function  comes  from  translation  invariance,  as  (o  1 dn)  depends  on  r = xj  - X2  only, 
and  that  isotropy  (the  |r|  dependence  of  £)  likewise  gives  the  isotropic  k dependence 
of  P(k).  Thus  we  see  that  on  average  of  S(k)  in  /.--space,  at  fixed  k,  gives  the  power 
spectrum  of  perturbations,  or  2-point  correlations,  of  the  random  process  in  the  sense 


We  need  to  relate  the  continuous  transform  analysis  to  the  practical  discrete  case 
which  we  use  later  on  in  calculations  and  simulations.  Fitst,  note  that  there  is  no 
equality  in  (2.133),  as  according  these  definitions.  P[k)  is  a power  spectral  density 
with  units  of  volume,  while  (|<5(k)|2)  is  the  power  per  mode,  with  dimensions  of  volume 
squared.  In  our  numerical  simulations,  and  in  analyzing  simulated  or  real  data,  we  go 
to  a discrete  case  whereby  data  is  contained  in  a large  fundamental  cube  of  volume 
V = Ij  , on  which  we  impose  periodic  boundary  conditions.  This  simply  invokes 


<d(k,)«(k2)>  = [ (2x)33D(ki  + k2)  ] P{k\) , 


(2.131) 


(2.132) 


P(*)~(|i(k)|2>. 


(2.133) 


the  usual  Fourier  analysis  whereby  a discrete  Fourier  transform,  corresponding  to  a 
Fourier  series,  goes  over  to  the  continuous  Fourier  transform  in  the  large  V'  limit.  We 
shall  place  an  n X n * n grid  in  2-space.  with  grid  spacing  -Ax  = L/n.  This  grid 
remains  in  place  in  /.--space  with  grid  spacing  Afc  - 2it /£..  and  there  the  relation  is 

<fo(k)|2)»P(k)(M)3,  (2.134) 

where  the  5rf(k)  in  this  equation  is  the  discrete  transform, 

4(k)«dd(k)(A*)-3.  (2.135) 

The  discrete  transform  pair  is.  with  x and  k lying  on  the  n3  grid  points. 

Mk)  = ^Ee'ik,t^W  <2136> 

= (2137) 

If  we  wish  to  make  the  f *-*  correspondence  seem  more  transparent,  we  may 
introduce  V (formally,  V = / (fix)  into  the  measure,  giving  a dimensionless  J(k),  as 
follows, 

S(k)  = (2.138) 

*(*)  = (2.139) 

Then  the  Dirac  delta  function  (2.129)  corresponds  to  the  discrete  Kronecker  delta, 

X>±,1“>WdD(k).  (2.140) 

The  second  moment  is  then 


<i(k,)«(k2))  = [^-i0(k,  +kj)]  P(*,), 


(2.141) 


109 


where  the  (. . .]  term  is  0 or  1 and  the  notation 


P{k)  =<|J(k)|2> 


(2.142) 


as  used  in  this  equation  refers  to  a dimensionless  power  per  mode  which  changes  (as 
it  should  not  if  it  were  a true  power  spectrum)  with  the  grid  size.  The  correspondence 
with  a true  power  spectrum  is  understood,  and  anyway  nothing  is  really  wrong,  since 
an  overall  normalization  will  fix  the  scale  (if  one  changes  grid  size  for  example).  This 
might  be  a little  sloppy  but  it  is  useful.  There  is  really  no  problem  as  long  as  one  is 
thinking  in  terms  of  one  or  the  other— either  continuous  or  discrete— during  a calcu- 
lation. We  go  ahead  with  the  notation  in  (2.142).  As  we  see  beginning  in  Chapter  4, 
it  is  this  P(k)  which  is  the  immediate  raw  data  (apart  from  possible  discreteness  and 
selection  function  effects)  from  a discrete  Fourier  analysis  of  data.  The  definitions  of 
the  power  spectrum  and  related  quantities  are  clarified  by  Bertschinger  (1992). 

From  the  the  Fourier  transform  pair  (2.138)  and  (2.139),  we  see  that 


and  so  the  contribution  to  the  local  variance  a2  from  the  interval  dink  = dk/k  ~ 1 is 
A2(fc)  — Vk3P(k)/2~2.  Furthermore,  the  variance  (62)r  at  length  scale  R — fc"l  is 
~ k3P(k),  which  shows  part  of  the  intuitive  appeal  of  P(k)  as  a description.  This  can 
be  seen  from  filtering  considerations.  Define  a window  function  W with  an  effective 
width  R in  x-space.  and  let  6w  be  the  smoothed,  or  convolved,  field: 


(2.143) 


%(*)  = I ^W'W*-*') 


(2.144) 


For  example,  we  have  the  x-spacc  top  hat 


W(k)  = ^(sini/  - ycosy),  y = kR, 


(r<H| 


(2.145) 


110 

and  we  could  use  a Gaussian  filter  to  minimize  the  high  frequency  sharp  edge  effects, 
if  need  be.  The  filtered  moment  works  out  as 

4 =<M*)%(x)>  = J ^/*(*)|H'(*)|2 . (2.146) 

and  from  this  we  have  or  ~ A(k  = /2-1),  though  this  depends  on  the  form  of  P(k), 
e.g.  for  P ~ kn,  we  need  -3  < n < 1 (of.  Kolb  k Turner  1990). 

The  bispectrum  and  trispectrum  (Peebles  1980.  Baumgart  and  FYy  1991)  can 
be  defined  from  the  third  and  fourth  moments  in  k-space.  analogously  to  equations 
(2.141)  and  (2.142).  We  now  compute  the  third  k- space  moment  (<5{k])i5(k2)6(k3)), 
clarifying  the  relation  between  the  bispectrum  and  the  spatial  3-point  function. 

<Wa)  = / 

= /££((  r,S,|r-s|)e-0-^*)fol  <2'“7> 

= Vo]  B{k\,ki,ki) . 

In  the  second  step  we  changed  coordinates  (x1.x2.x3  i-t  xi.xi  + r.xi  + s)  and 
utilized  the  homogeneity  of  (.  The  notation  [d/>|  refers  to  the  momentum  conserving 
delta  function 

fo]=[^<SD(£ki)]  (2.148) 

whici  in  this  case  gives  1 for  closed  triangles  on  the  k grid  and  gives  0 otherwise. 
Thus  we  have  the  convenient  notation,  similar  to  (2.142),  which  says 

B(ki,k2,k3)  =(i(k|)5(k2)4(k3))|^,k  . (2.149) 

The  hierarchical  form  for  Cl23,  equation  (2.122),  goes  over  in  (.-space  to  (Fry  k 
Seldner  1982) 

0123  *=Q(PtP2  + P2P3  + P3P,)  (2.150) 

with  the  same  Q value,  if  Q is  constant.  In  (2.150),  we  have  k,  + k2  + k3  = 0,  and 
P,  = P(k,).  In  practice,  the  power  P and  bispectrum  B are  useful  on  both  small 


(r  S 10  A-1  Mpc)  and  large  scales,  whereas  £ and  C estimators,  based  on  the  idea  of 
equation  (2,116).  have  severe  signal  to  noise  problems  on  large  scales.  It  is  useful  to 
define,  based  on  (2.150),  a reduced  bispectrum  front  the  ratio 

This  proves  useful  in  that  several  orders  of  magnitude  in  B can  be  reduced  to  Q 
variations  on  the  order  of  1.  Furthermore,  perturbation  theory  predictions  for  Q 
reduce  to  (2.150)  in  a special  case  (P  ~ kn,  ki  = *2  = *3),  and  in  general  the  Q 
functional  dependence  from  perturbation  theory  pertains  to  large-scale  measurements, 
which  confirms  that  gravitational  instability  is  in  effect. 


It  is  conventional  to  assume  that  the  primordial  density  fluctuations  are  a ho- 
mogeneous Gaussian  random  field.  Such  a field  is  completely  specified  by  its  2-point 
correlations,  as  the  connected  (V-point  correlation  functions  are  all  zero  for  N > 2. 
One  can  argue,  roughly  speaking,  that  the  validity  of  this  assumption  is  a natural 
consequence  of  early  fluctuations  representing  noise  with  random  phases  in  Fourier 
space,  invoking  the  central  limit  theorem  (Zel’dovich  & Novikov  1983), 

With  the  Gaussian  random  field  assumption,  it  follows,  for  example,  that  the 
one-point  probability  distribution  P for  the  initial  4(x)  field  is  Gaussian. 


oJ  = {(0)  =(4(x)4(x)) 

where  a « 1 is  the  same  at  ail  points.  In  fact,  the  random  field  description  involves 
a probability  functional  P[4(x)],  and  its  related  moment  generating  functional,  as 
explained  by  FVy  (1997),  Borgani  (1995),  and  Bcrtschinger  (1992).  The  homogene- 
ity and  Gaussianity  assumptions  simplify  matters  greatly.  We  rely  heavily  on  these 


112 


simplifications,  especially  in  the  numerical  simulations  we  present  below.  Thus  we 
have  that  the  iV-point  probability  distribution  for  <Sx . . . <S.v  is  Gaussian  with  covari- 
ance function  £(riy),  with  all  higher  connected  moments  vanishing — in  analogy  with 
the  single  variable  Gaussian  distribution.  In  particular,  the  covariance  function  is 
diagonal  in  k- space,  and  so  the  components  of  5(k)  are  distributed  normally, 


ol  = P(k)/2 


where  6 stands  for  both  the  real  and  imaginary  parts  of  4( k).  The  power  per  mode 
is  denoted  by  P(k),  as  in  equation  (2.142).  In  terms  of  modulus  and  phase,  this 
says  that  the  phases  are  uniformly  random  and  that  the  modulus  |4(k)|  is  Rayleigh 
distributed, 

i(k)  = |4(k)|e'8 

ewui)  atm  -§$•*>(-  §$y)  ■<i«i  (cmi 

where  the  mean  and  standard  deviation  of  the  |4(k)|  distribution  are 


(2.155) 


The  Rayleigh  distribution  (2.154)  is  obtained  by  treating  the  random  variable  |4(k)| 
as  derived  from  the  independent  real  and  imaginary  parts  of  4(k).  Similarly,  the 
distribution  for  |4(k)|2  is  exponential. 


e||i«l>|  «,(-  EB£)  *«,< , (2.166, 

with  mean  and  standard  deviation  given  by 

<|4(k)|a>  = P(k) 

°|4(k)P  = '^p(k)- 


(2.157) 


113 

Primordial  power  is  conveniently  parametrized  as  a featureless  power  law.  P[k)  ~ 
kn.  The  Zel'dovich  scale-free  spectrum  (Harrison  1970,  Peebles  & Yu  1970.  Zel'dovicli 
1972).  which  has  spectral  index  n = 1.  is  favored  theoretically.  Also,  inflationary 
theories  predict  P ~ k fairly  robustly.  Using  the  COBE  measurements  to  tie  down 
the  large-scale  fluctuation  spectrum  indicates  that  n = 1 is  likely  (Gorski  et  al.  1994). 

The  Zel’dovich  spectrum  is  scale-free  in  the  sense  that  gravitational  potential 
fluctuations  have  equal  amplitude  on  all  scales  (Bertschinger  1993).  The  potential 
fluctuations  on  a physical  scale  are  — k3|^|2.  The  scale-free  condition  requires  a 
potential  spectrum  |^|2  ~ k~2,  but  tp~6/k2  from  the  Poisson  equation,  and  thus  we 
need  |3|2  - k.  In  the  language  of  density  fluctuations,  the  interpretation  of  P(k)  ~ k 
as  scale-free  refers  to  the  following  fact.  A mean  square  fluctuation  — k2P(k)  of 
physical  scale  A*-1,  growing  according  to  superhorizon  perturbation  theory,  has  a 
constant  (as  HP5)  amplitude  as  the  wavelength  enters  the  horizon  (dg  ~ t).  for  all 

While  fluciuauons  are  evolving  early  in  the  linear  regime,  with  i > zeq  (the 
matter-radiation  equality  epoch  is  r„,  as  104),  physical  processes  modified  the  pri- 
mordial spectrum  prior  to  the  onset  of  structure  formation.  This  is  codified  in  a 
linear  transfer  function  T which  describes  evolution  from  some  primordial  time  lp  to 
some  later  time  tj,  which  acts  as  the  initial  time  for  the  dynamical  evolution  in  our 
equations  of  motion.  Recalling  that  S ~ A(t)  in  linear  theory,  we  connect  the  early 
power  spectrum  P{k,tp)  to  the  spectrum  at  ij,  corresponding  to  z as  1000  with  T2, 

P{k' U)  = [^y] Jr2(*’  b)  ■ (2-158) 
For  physical  wavelengths  » le,,  the  horizon  at  Ce,,  the  slope  of  P(k)  (at  small  k ) 
is  unaffected.  The  shape  of  P(k)  for  lengths  *_1  $ (large  k)  is  affected,  and 
the  details  depend  on  the  particular  model  being  used.  Transfer  functions  for  various 


models  ate  reviewed  by  Efstathiou  (1990).  A convenient  numerical  lit  from  the  original 
CDM  paper  (Peebles  1982)  is 

T2  = (1  + ak  + JkV2 . (2.159) 

where  the  numerical  factors  are  provided  by  Peebles  (1982).  There  was  an  onslaught 
of  CDM  and.  for  a while,  HDM  papers  in  the  1980's.  Some  numerical  fits  to  the 
transfer  functions  for  such  models  are  provided  in  Bardeen  et  al.  (1986). 

The  CDM  function  in  (2.159)  shows  the  typical  bend. 


which  shows  the  preferred  length  scale  k^1  « 10/SloA2  Mpc,  the  horizon  at  matter- 
radiation  equality,  though  the  transition  from  k to  is  gradual.  This  transition 
from  a primordial  P kn  to  P ~ fcn“4  follows  from  the  fact  that  fluctuations  of 
non-relativistic  (cold)  matter  which  enter  the  horizon  at  r > zrq  practically  do  not 
grow  until  s £ zr,f,  since  the  subhorizon  perturbations  in  the  dominant  radiation 
oscillate  like  an  acoustic  wave  (Peebles  1980). 

The  HDM  transfer  function  (Bond  & Szalay  1983)  is 

T « 10<-*/*'>‘ 1 , (2.161) 

corresponding  to  a universe  dominated  by  collisionless  neutrinos.  The  free  streaming 
damping  scale  is  13(n„A2)-‘  Mpc  which  is  « 40  Mpc  for  a 30eV  neutrino.  Small 
scales  are  washed  out,  and  large  structures  form  first,  in  what  would  be  the  most 
direct  application  of  Zel’dovich  pancake  formation.  There  are  problems  in  timing 
(galaxies  would  form  too  late)  and  &T/T  (cosmic  background  radiation  fluctuations 
would  need  to  be  too  large)  with  the  simplest  HDM  models. 


The  initial,  linear  regime.  P(k)  can  be  normalized  in  various  ways,  as  reviewed  by 
Bcrtschinger  11992).  Two  ways  arc  provided  by  Peebles  (1982).  Large-scale  normal- 
ization (at  scales  = 104/i_1Mpc)  is  based  on  matching  the  small  (d2) g fluctuations 
at  the  horizon  with  COBE  quadrupole  anisotropies.  Matching  <r|.  defined  to  be  the 
(d2)^  at  R = 8 ft-1  Mpc,  with  a%  as  1 from  observations  is  a smaller  scale  normaliza- 
tion: this  method  uses  equation  (2.146).  These  two  normalizations  are  equivalent  if 
the  linear  P(k)  accurately  spans  such  scales. 

The  last  point  which  we  mention  here  concerns  galaxies,  which  form  the  observa- 
tional basis  of  our  subject.  We  must  assume— rather  cavalierly  perhaps,  because  we 
know  it  is  an  oversimplification — that  the  present  observable  distribution  of  galaxies 
is  directly  related  to  the  random  field  d(x).  Discreteness  of  the  sample  complicates 
matters,  but  this  is  not  a fundamental  problem  for  the  model  (homogeneous  random 
process)  that  we  use.  The  problem,  which  we  mentioned  at  the  outset,  is  that  so  much 
physics  (galaxy  formation,  galaxy  statistics)  must  be  ignored  in  the  assumption.  As- 
suming that  we  have  a handle  on  d(x),  we  may  use  the  equations  of  motion  developed 
above  to  evolve  the  density  (and  velocity  and  potential)  forward  or  backward  in  time. 

That  said,  there  is  one  last  complication  which  we  may  at  least  try  to  model 
simply.  This  concerns  the  relation  between  6(x)  and  ds(x),  the  matter  and  galaxy 
distributions.  If  there  is  no  relation,  then  physicists  would  need  to  go  back  to  the 
drawing  board.  Possibly,  galaxies  faithfully  trace  the  mass,  so  that  a smoothed  6g 
field  would  really  correspond  to  6.  At  the  least,  there  should  be  some  functional 
relationship,  and  if  the  range  of  influence  is  not  large,  then  we  would  have  a local  bias 
(Coles  1992.  Kaiser  1984),  with  Ss(x)  = f(6(x)).  The  simplest  case  Is  linear  bias, 
where  fluctuations  arc  linearly  stretched,  6g  = btS. 


116 


£4 Perturbation  Theory  for  Statistics 


We  now  turn  to  some  of  the  higher  order  perturbation  theory  results.  The  linear 
solutions,  introduced  in  §2.2.4,  arc  of  obvious  importance,  as  they  represent  the  lead- 
ing order  term,  which  is  the  largest  when  (i2)  « 1.  As  such  they  describe  the  initial 
deviations  from  uniformity. 


Going  to  second  or  third  order  gives  results  which  are  surprisingly  useful  in  analys- 
ing both  iV-body  simulations  and  actual  galaxy  survey  data.  This  might  be  surprising 
because  the  higher  order  solutions  are  describing  the  development  of  nonlinearities. 


The  linear  density  solution. 


d<1>(x,l)  = A(f)<So(x) 
i0(x)  = d(x,ti), 

is  modified  by  the  addition  of  higher  order  terms  in  the  perturbation  st 


(2.162) 
s.  e.g.  of 


order  — (dl1*)2,  which  are  nonlocal  and.  naturally,  change  the  shape  of  the  initial  iSo- 
But  as  the  higher  order  terms  become  important,  as  d rs  1,  all  the  higher  order  terms 


are  large,  and  so  the  usefulness  of  keeping  just  the  second  and  third  order  terms  is  not 


obvious  a priori.  Finally,  the  convergence  of  a perturbation  series  expansion  has  not 
been  proven.  Despite  those  grievances,  it  is  fair  to  say  that  higher  order  perturbation 
theory  is  quite  relevant  in  what  one  might  call  the  quasi-linear  regime  (Fry  1984,  Jain 
k Bertschinger  1994). 

We  utilize  the  formalism  and  notation  developed  by  Peebles  (1980),  Fry  (1984), 
and  Goroff  et  al.  ( 1986).  This  is  sometimes  referred  to  as  Eulerian  perturbation  theory, 
since  we  use  comoving  spatial  coordinates.  Thus  the  distinction  is  made  between 
this  and  Lagrangian  perturbation  theory,  which  uses  comoving  material  coordinates 
(which  follow  particle  trajectories). 

The  fundamental  equation,  developed  in  §2.3.3.  is 


0*6  u 36  _ i | 

gfi  + 2 a di  ~ 4”G^  = ‘ VS  + “o  VjVy((l  + d)t/V] . 


Ignoring  the  entire  right  hand  side,  which  contains  higher  order  terms,  we  get  dW  as  in 
(2.162),  the  (rescaled)  gravitational  potential  which  is  the  solution  to  VSA(’'  = d11*. 
and  the  velocity  v'l>  = -o.-i/AVA11*. 

Wo  write  the  perturbation  series  as 

d = d<l)+d<»>  + d<3>  + ....  (2.164) 

where  the  higher  terms  are  of  higher  order  in  smallness  (for  d 1).  In  particular,  the 
iterative  solutions  show  that  d<")  ~ .4'‘0(4J).  For  11  = 1,  using  the  growing  mode 
.4  ~ t2*3,  the  solutions  are  exactly  of  the  form 

4(x.()  = |>’,(t)<S„<x)  (2.165) 

as  shown  by  Gorolf  et  al.  (1986).  Results  were  obtained  by  Bouchet  et  al.  (1992)  for 
arbitrary  fl,  and  for  H ns  1 the  series  is  nearly  of  the  form  (2.165),  with  very  weak  SI 
dependence. 

To  obtain  the  differential  equation  for  <5®,  we  use  the  linear  results,  in  terms 
of  d>*)  and  Al*l,  on  the  right  hand  side  of  equation  (2.163),  keeping  terms  of  order 
(d(l))2;  on  the  left  hand  side  we  use  d^2),  as  per  a standard  iterative  scheme.  In  the 
fl  = 1 case,  this  leads  to  the  following  second  order  solution, 

((■1 . 

where  /,(  = df/dx'.  This  follows  the  time  dependence  as  shown  in  (2.165)  and  also 
has  (d(2))  = 0,  as  expected. 

As  an  illustrative  application,  we  may  compute  the  evolution  of  the  third  moment 
(d(x,  t)  ) from  Gaussian  initial  conditions  (where  all  connected  moments  of  do  above 


118 


the  2-point  function  vanish).  We  assume  that  fi  = 1.  To  leading  nonvanishing  order, 
we  have  (recall  that  is  a Gaussian  field  whose  third  moment  vanishes) 


This  result,  based  on  (2.166),  is  shown  by  Peebles  (1980).  Details  are  also  provided 
by  Fry  and  Scherrer  (1993),  who  generalise  to  a case  with  non-Gaussinn  initial  con- 
ditions. The  local  skewness  parameter.  S3  =(d3)/(d2)2.  is  thus  independent  of  time 
and  the  form  of  the  initial  2-point  correlations.  If  we  consider  the  initially  Gaussian 
distribution  function  for  the  6 values,  we  may  ask  how  the  skewness  (in  the  usual 
sense  of  the  word)  of  the  distribution  evolves.  This  normalized  skewness  is 


and  it  increases  with  time  as  /1(f),  to  lowest  order. 

Further  hierarchical  amplitudes  Sy  coming  from  the  connected  A'-point  correla- 
tion function  average  (6N)C  con  be  similarly  defined,  (6N)C  = Sy(62)N~l,  and  the 
Sjv  are  constant  in  the  Einstein-de-Sitter  case,  as  in  (2.167),  and  largely  independent 
of  cosmological  parameters  in  general  (Goroff  ct  al.  1986,  Bemardeau  1992).  For 
a possible  relation  to  observation,  we  need  to  use  the  smoothed  density  field  dip, 
as  defined  in  equation  (2.144).  In  practice,  the  moments  of  counts  in  cells,  as  in 
equations  (2.123*2.126),  are  related  to  volume  averaged  correlation  functions  (apart 
from  discreteness  corrections),  and  the  hierarchical  scaling  parametrized  by  Sy  is  an 
observable  effect.  Smoothing  at  scale  R,  the  third  moment  of  % gives 


<43)  .((id)  + + ...p) 

= 3(d<»24<2)> 

= y(4<')2)2 


(2.167) 


<<*>/<**>V2  = y(d2)1/2, 


(2.168) 


''logoff 
(flog  72 


(2.169) 


113 

where  ''i  is  scale  dependent.  The  ti  term  depends  on  the  shape  of  P(k)  and  the  shape 
of  the  filter,  in  general.  For  initial  power  P - k".  with  a top  hat  filter  as  in  (2.145). 
we  have  (42)r  ~ R-("+3),  which  gives  the  rather  nice  result  that  S3  = 34/7-(n+3). 
Bernardeau  (1994)  includes  this  and  higher  order  results. 

We  focus  on  the  situation  in  Fourier  transform  space,  as  introduced  in  §2.3.2, 
where  comoving  coordinates  x and  wavevectors  k describe  the  complimentary  do- 
mains. Linear  perturbation  theory,  viewed  in  the  Fourier  domain,  says  that  fluctu- 
ations at  different  wavelengths  evolve  independently.  Nonlinear,  nonlocal  effects— 
represented  to  lowest  nonvanishing  order  by  equation  (2.166)— are  seen  as  mode  cou- 
pling in  k-space. 

Consider  the  full  5(x,  t)  equation  (2.163)  viewed  in  Fourier  space.  Considering 
the  lowest  order  nonlinear  effects  of  order  fit1)2,  we  would  have  an  equation  of  the 
following  form  for  fi(k,f)  = fi|„ 

W' + 2Vat  ■ = / MM'hrhrFQt*  k'.  k")fiD(k  - k'  - k") . (2.170) 

For  example,  this  generates  harmonics  for  k values  which  are  not  even  present  in  the 
initial  P[k).  In  general,  the  evolution  of  a mode  k is  determined  by  coupling  of  all 


represents  perturbations  up  to  second  order,  in  that  fi(U(k,t)  is  the  solution  to  the 
equation  with  the  right  hand  side  zero,  and  fi^  comes  from  two  interacting  modes 


In  particular,  for  fi  = 1,  with  r2  = 6tr Gp  and  a ~ A ~ t2/3.  we  keep  track  of 
the  lowest  nonlinear  terms  to  get 


_«+2._J._^k  = 


28 Gp  fVJI$\ 5 , k,  - (k  - k,) 

3 } (50*1.7 + |k-k,|2 


2 /k|  ■ (k  - ki) 
7 V Ai|k  — k,  | 


) j^ki^k-ki  - 


(2.171) 


Neglecting  the  right  hand  side  of  (2.1711  gives  d*1'  ~ 4(1),  the  transform  of  the  linear 
solution.  Including  the  right  hand  side,  and  equating  terms  of  second  order,  gives 
4**  ~ -42(t).  the  transform  of  the  x-space  second  order  solution  (2.166).  which  we 

42>  = /j^l^G<')(k1.k2)4l4,fo)  (2.172) 

to)  = [^*l>(k-EW]-  (2-174) 
The  relations  among  6,  v.  and  A in  k-space  are  quite  convenient,  with  differentiation 
Vjj  bringing  about  factors  of  k,  But  products  in  x-space  are  of  course  transformed 
into  convolutions.  The  tradeoff  works  in  favor  of  transform  space,  however,  as  shown 
by  Fry  (1984)  and  Goroff  et  al.  (1986)  when  third  and  higher  order  perturbations 
are  considered.  This  advantage  in  computational  formalism  is  in  addition  to  the 
advantages  of  Fourier  spectra  as  regards  to  analysis  of  large-scale  data. 

The  mode  coupling  kernel  (»28*(ki,k2)  is  the  first  in  a sequence  which  specifies  the 
perturbation  series  (2.164)  in  Fourier  space.  Even  in  nonsymmetrized  form,  the  third 
order  kernel  (Fly  1984)  has  a lengthy  formula  not  reproduced  here.  The  (nonsym- 
metrized) fourth  order  kernel  takes  a full  page  to  write  down,  and  Goroff  et  al.  (1986) 
supplied  this  as  well  as  a recursion  for  G„(ki, . . . ,k„)  and  an  analogous  kernel  for  the 
velocity  divergence  (which  specifies  v since  it  is  a gradient).  Jain  and  Bcrtschinger 
(1994)  clarified  the  formalism  and  compared  second  order  power  spectrum  calcula- 
tions (which  go  to  third  order  in  perturbation  theory)  to  jV-bodv  simulations.  Fry 
(1994a)  placed  the  higher  order  power  spectrum  calculations  in  compact  form  and 
produced  a calculation  up  to  fifth  order  in  perturbation  theory  for  the  case  of  a top- 
hat  power  spectrum,  This  clarified  how  small  k and  large  k power  is  generated  from 
an  initial  spectrum  which  is  nonzero  for  only  a finite  range  in  k. 


Using  the  second  order  kernel  (2.173),  we  may  calculate  the  bispectrum  to  lowest 
nonvanishing  order  from 


Recall  that  £ kj  = 0:  the  triangle  must  be  closed  on  the  k- space  grid.  We  find  that 

1 . *2.  *a)  = QnPiPi  + Q23P2P3  + Qu  PiPi 


Thus  we  have  a perturbative  result  for  Q = P\2i! [P\Pl  + P2P3  + P3 P\ ) . and  the 
shape  dependence  of  Q provides  a signature  for  gravitational  instability.  There  are 
two  particularly  simple,  useful  triangle  configurations  (they  must  be  closed  triangles, 
by  homogeneity).  For  equilateral  triangles  of  side  k (k,  ■ kj  — — 2),  W'e  have  Qaj(k)  as 
a function  of  scale.  If  we  fix  kt  and  £2  (k\  = 2kj  proves  to  be  a practical  choice)  and 
let  cos  (I  = k(  • Ic2,  then  Q{9)  is  a useful  parametrization  based  on  shape,  and  it  spans 
scales  from  |kj  - kj|  down  to  |k|  + k:|.  Fry  (1994b)  has  used  the  Q(0)  statistic  as 
a useful  method  to  isolate  the  effects  of  biasing  from  clustering  and  parameters  such 
as  flo  and  //q. 


(2.175) 


(2.176) 


CHAPTER  3 

ZEL’DOVICH  APPROXIMATION 

3J — The  Equations  of  Motion  and  the  Approximation 
3.1.1 — Material  Coordinate  Formulation 

The  Zel'dovich  approximation  of  gravitational  instability  takes  place  within  a 
Lagrangian,  or  material  coordinate,  formulation  of  the  cosmological  fluid  equations 
(Zel’dovich  1970).  In  the  last  chapter,  we  introduced  the  conventional  Eulerian.  or 
spatial  coordinate,  approach  to  the  equations  and  the  resulting  perturbation  theory. 
As  in  classical,  earthly  fluid  mechanics,  both  approaches  should  be  equivalent  yet 
complimentary,  each  having  its  own  advantages  and  disadvantages  (Truesdell  1954). 

We  consider  the  paths  of  individual  particles  or  fluid  elements  beginning  at  a 
time  fj  (the  decoupling  era,  for  example)  when  the  universe  was  highly  uniform. 
The  comoving  coordinates  at  lime  t,-  are  taken  as  labels,  or  material  coordinates, 
which  identify  the  particles  throughout  their  trajectories.  We  denote  these  material 
coordinates  by  q.  The  comoving  coordinates  x following  a trajectory  are  x(q,t). 
Formally,  for  convenience,  we  take  (,•  = 0,  so  we  have  x(q,  0)  = q.  The  trajectories 
in  proper  coordinates  are  given  by  r(q,t),  which  includes  both  the  Hubble  flow  and 


fluid  motion  is  viewed  as  the  Lagrangian  map  q -4  x(q,  (),  most  conveniently  studied 
in  comoving  coordinates  x (as  we  have  specified)  rather  than  in  the  r coordinates. 


122 


123 

We  write  this  x(q,  t)  mapping  as 

x = q + *(q,t).  (3.1) 

where  the  displacement  'h(q.t)  describes  the  gravitational  instability,  so  = 0 rep- 
resents a pure  Hubble  flow. 

In  parallel  with  the  development  in  Chapter  2.  we  may  take  an  ideal  fluid  (coi- 
lisional)  or  a collisionless  viewpoint.  Taking  a conventional  ideal  fluid  continuum 
approach,  each  fluid  element  will  be  described  by  its  mass,  position,  and  velocity. 
Furthermore,  each  particle  possesses  thermodynamic  properties,  and  the  pressure 
forces  must  be  accounted  for.  We  follow  the  collisionless  approach,  which  again  is 
most  appropriate  for  the  era  of  structure  formation.  Therefore  we  consider  a collision- 
less gas  of  particles  which  interact  by  gravitation  only.  The  problem  is  thus  simplified 
to  gravitational  instability  only,  as  panicles  are  endowed  only  with  mass,  position, 
and  velocity.  The  panicles  follow  geodesics  of  the  perturbed  Robertson- Walker  met- 
ric (2.72).  The  equations  of  motion  (Newtonian  limit)  are  those  we  developed  in 
§2.2,3,  and  again  we  utilise  the  separation  of  the  Hubble  flow  from  the  the  overall 
trajectories,  with  the  dynamics  following  from  the  Lagrangian 


where  m is  a particle  mass,  oft)  is  the  expansion  factor,  x is  the  comoving  coordinate, 
and  Ip  is  the  peculiar  gravitational  potential. 

The  equations  of  motion  in  comoving  coordinates  are 


The  time  derivative  d/dt  is  convective  (evaluated  at  fixed  q). 


124 


To  this  we  must  add  the  Poisson  equation. 


V2v>(x.()  = 4xGpn25(x,() 


(3-5) 


Expressed  in  terms  of  the  peculiar  acceleration  g we  have 


V • g = — AnG/m6  Vxj  = 0. 

Note  that  the  Poisson  equation  requires  the  density  contrast  held  in  spatial  coordi- 

In  the  Lagrangian  formulation,  continuity  is  assured  by  simply  applying  con- 
■or vat  ion  of  mass  to  the  trajectories,  and  before  multistreaming  occurs  in  the  x(q,  t) 
mapping,  this  is  a straightforward  procedure.  We  equate  the  mass  in  the  fluid  element 
at  q,  at  time  tj,  which  is  p(tj)a3(t,-)d3ij  to  the  mass  at  x(q,  t ),  which  is  p(x,  t)a3(t)d3x, 
and  note  that  pa3  is  constant  in  time.  Therefore  we  write 


noting  that  this  applies  exactly  for  single  stream  flow.  The  deformation  of  the  volume 
element  is  described  by  the  Jacobian  of  the  mapping, 


where  || . . . ||  denotes  the  determinant,  and  so  we  define  the  Jacobian  J(q,  t)  and  a 
deformation  tensor  Jij  as 


In  terms  of  the  density  contrast  6.  defined  by  p = p(l  + <S),  continuity  is  expressed  as 


pfatjd3!  = p(t)<Pq, 


(3.7) 


(3.8) 


(3.9) 

(3.10) 


1 + 6 = J-' . 


(3.11) 


125 


The  peculiar  velocity  v(x(q,  t),  f]  can  be  written  as 


(3.12) 


where  the  d/dt  represents  the  convective  (d/8t\ q.  In  general,  the  q and  x derivatives 
are  related  by 


JL  = W.  JL 

dx'  Ox'  dqi 


The  Eulerian  continuity  equation  is 


(3.13) 

(3.14) 


^ + jV  • (1  + <S)v  = 0 (3.15) 

^ + i(l  + 4)Vv  = 0,  (3.16) 

and  we  have  rewritten  (3.15)  using  (<M/9t)q. 

One  of  the  secrets  to  the  success  of  the  Lagrangian  approach  and  its  associated 
perturbation  theory  is  that  the  Eulerian  continuity  equation  is  satisfied  automatically. 
Note  that  6 does  not  appear  as  a dynamic  variable  in  the  equations  of  motion  (3.3) 
aud  (3.4),  while  it  of  course  must  appear  in  the  Eulerian  continuity  equation.  It  is 
easy  to  show  how  this  works  out.  We  compute  the  convective  time  derivative  of  6, 


(I). 


Expanding  the  determinant  and  taking  its  derivative,  we  obtain 


" dq‘  dql  dqk 
fd  dx1 1 fo2 Ox3 

<ijk  t St  On<  I On!  l)nk 


dq'  dqi  dtp 
[2  cyclic  terms], 


where  Cjjj.  is  the  totally  antisymmetric  permutation  tensor.  The  {...} 
d dx 1 _ 9®1  _ 1 do1  dxm 
dt  dq'  ~ d if  ~o9x™ 


(3.17) 


(3.18) 

(3.19) 


(3.20) 


126 

and  so  only  the  m = 1 term  survives  to  give  a~ldvl/dxlJ  as  the  first  of  the  three 
terms  in  (3.19).  Therefore  we  have 


which  establishes  the  consistency  of  the  two  approaches,  comparing  (3.16)  with  (3.17). 
This  all  follows,  more  concisely,  from  the  identity 

§1  log  ||J<;  ||  (3.22) 

The  Euterian  velocity  field  due  to  gravitational  instability  is  irrotational.  Recall 
that  rotational  modes  for  v do  not  couple  to  gravity,  and  from  the  Kelvin  circulation 
theorem  in  an  expanding  universe,  such  modes  decay  as  a-1.  The  displacement 
mapping  ®(q,t)  could,  in  principle,  encompass  such  transverse  modes  in  general 
and  in  perturbation  theory  (Buchcrt  1992).  As  appropriate  to  large-scale  structure 
formation,  we  have 

V x v = 0 (3.23) 

in  the  Eulerian  picture.  Using  (3.13)  and  (3.12),  this  important  irrotational  condition 
translates  to 

«ii*Jyl4l  = 0 (3.24) 

in  material  coordinates,  and  this  is  valid  during  single  stream  flow  and,  later  on, 
outside  of  multistreaming  regions  which  occur  as  pancakes,  filaments,  and  clumps 
are  formed.  Expressing  the  equation  of  motion  following  a particle,  by  taking  the 
divergence  of  (3.4)  and  utilizing  (3.13),  we  have 

Vs  (“’iS*)  = - 1),  (3.25) 

which,  together  with  (3.24),  provides  the  dynamics  of  the  *(q,  t)  map. 

This  reduction  of  the  equations  of  motion  to  (3.24)  and  (3.25)  is  similar  to  what 
transpires  in  Eulerian  coordinates  when  deriving  the  Bernoulli  equation.  For  potential 


127 


flow,  we  may  integrate  the  Euler  equation  to  obtain  the  Bernoulli  equation  in  a 
particularly  simple  form.  Writing  v as  a gradient,  v = -Vtpu/u,  the  Euler  equation 


Despite  the  complexity  of  the  large-scale  structure  problem  overall,  there  is  an 
underlying  simplicity  in  our  restricted  problem  which  was  recognized  early  on  by  the 
famed  Russian  physicist  Yakov  Borisovich  Zcl’dovich  (1914  - 1987).  In  1970,  Ya.  B. 
Zel'dovich  published  the  article  ‘Gravitational  Instability:  An  Approximate  Theory 
for  Large  Density  Perturbations"  (Zel'dovich  1970).  The  paper  is  brief,  intuitively 
and  physically  motivated,  and  (as  came  to  be  realized  some  10  to  15  years  later) 
important.  Recently,  it  has  loomed  even  more  significant  as  the  approximation  in- 
troduced therein  has  been  found  to  be  much  more  general  than  first  suspected.  The 
matter  of  the  universe  may  be  primarily  baryonic  or  noubaryonic;  the  nonbaryonic 
component  may  be  “hot"  or  “cold”.  Regardless,  the  approximation  has  something  to 
say  about  structure  on  large  scales. 

The  linear  theory  of  perturbations,  which  we  addressed  in  §2.2.4,  is  well  estab- 
lished. It  can  be  thought  of  as  the  exact  theory  in  the  realm  of  small  perturbations, 
(d2)  1.  As  we  discussed  in  Chapter  2,  when  the  perturbation  series  is  continued  to 
second  and  third  order  (and  beyond)  the  Al-point  correlation  hierarchy  may  be  calcu- 
lated (Peebles  1980,  Fry  1984,  Goroff  ct  al.  1986)  for  n = 3,  n = 4,  etc.  in  hopes  that 
the  observed  galaxy  correlations  may  retain  a statistical  imprint  of  the  perturbation 
theory  results.  Naively,  one  would  expect  the  perturbation  series  results  to  have  a 


(3.26) 


reduces  to 


(3.27) 


12S 

bearing  only  on  the  largest  scales  (where  (d2)  is  still  small),  but  there  is  evidence  that 
this  imprint  is  discernible  even  in  the  nonlinear  regime.  Nonetheless,  perturbation 
theory  is  generally  limited  by  the  constraint  (i2)  < 1.  This  is  merely  to  suggest  that 
an  extrapolation  of  perturbation  theory  may  be  in  order.  This  is  somewhat  obvious. 
However,  when  Zel’dovich  made  his  suggestion  to  do  so,  there  was  no  observational 
evidence  to  speak  of. 

The  basic  idea  of  the  approximation — remarkably  simple  in  hindsight — is  to  ex- 
trapolate linear  gravitational  theory  into  the  nonlinear  regime  by  following  perturba- 
tions of  particle  trajectories,  rather  than  tracking  perturbations  in  the  density  field. 
As  such,  the  Lagrangian  formalism,  which  has  some  inherent  advantages  in  analyz- 
ing cosmological  perturbations,  was  introduced  by  Zel'dovlch.  The  approximation 
involves  separating  the  space  and  time  dependence  in  the  Lagrangian  mapping,  by 
writing  S' (q,  t)  = 6(t)u(q),  and  following  the  particle  initially  at  q to  its  Eulerian 
comoving  coordinate  x,  related  to  the  proper  the  Eulerian  proper  coordinates  by 
r = ox.  Thus  the  Zel’dovich  approximation  has  the  following  simple  form, 

r(q,f)  = a(t)|q  + 4(t)u(q)),  (3.28) 
where  the  growing  function  6(t)  characterizes  the  growth  of  the  unstable  gravitational 
fluctuations.  It  turns  out  that  b(t)  is  just  the  growing  mode  of  linear  fluctuations  A(t), 
though  we  keep  the  6(t)  notation  of  Shandarin  and  Zel'dovich  (1989). 

Linear  perturbation  theory  in  the  Eulerian  picture  states  that 

i(x,i)  = A(f)8oW-  (3.29) 
The  function  ,4(t)  is  the  growing  solution  of  (2.97),  and  the  primal  density  contrast 
3o(x)  thus  retains  its  shape  in  the  initial  stages  of  growth.  The  peculiar  velocity  field 

v(x,<)  = -o^VA. 


(3.30) 


129 

where  A,  the  rescaled  first  order  peculiar  gravitational  potential, 
is  a solution  to  the  Poisson  equation,  which  to  this  order  reads 


(3.31) 


V1A  = A(t)t0(x).  (3.32) 

The  peculiar  acceleration  g(x,  t)  is  parallel  to  v(x,  t)  and,  of  course,  V x v = 0.  For 
the  time  dependence  in  linear  theory,  we  may  write  6 ~ >1.  The  potential  is  nearly 
time  independent.  y>  ~ A/a,  and  for  SI  = 1 we  have  A -a  and  is  constant  in  time. 
The  rescaled  potential  obeys  A ~ and  the  peculiar  velocity  follows  v ~ aA. 

To  extend  (3.29)  as  it  stands  leads  to  problems.  The  initial  power  spectrum 
<|J0(k)l2)  evolves  by  a factor  of  A2(l)  while  retaining  its  shape,  and  we  know  this 
result,  while  valid  early  on  and  useful  for  very  large  scales,  is  not  reliable  as  a detailed 
extrapolation,  since  mode  coupling  is  ignored.  Moreover,  the  following  inconsistency 
is  most  egregious:  while  A(t)  grows,  (3.29)  would  say  that  in  some  regions  6 might 
have  a value  of,  say,  1 or  2,  while  in  other  regions  6 acquires  a meaningless  value  less 


That  a naive  extension  of  linear  theory  goes  astray  is  no  surprise.  During  lin- 
earisation, the  V ■ (<5v)  term  from  the  continuity  equation  and  the  (v  • V)v  term  from 
the  momentum  equation  have  been  ignored. 

In  a material  coordinate  approach,  an  approximation  is  formed  once  a particular 
model  for  the  1'(q,f)  mapping  is  made.  Naturally,  agreement  with  (3.29)  must  be 
insured  for  small  displacements.  Once  we  have  adopted  a form  for  <p,  it  follows  that 
we  have  adopted  an  equation  of  motion  for  each  particle.  The  peculiar  velocity,  as  a 
function  of  q and  t,  is  given  by 


(3.33) 


130 


and  the  peculiar  acceleration  is 


(3.34) 


The  equation  of  continuity  (3.11)  will  be  satisfied  exactly.  The  distributions  of  density 
and  velocity  in  space  arc  calculable  once  'P(<i,  ()  is  specified.  That  is,  the  mapping 
x(q,  t)  will  describe  the  entire  picture  of  the  motion.  The  Poisson  equation  relating 
V • g and  6 will  not  be  satisfied  (hence  the  phrase  ‘approximation” } However,  for 
small  displacements  the  required  agreement  with  linear  perturbation  theory  will  insure 
that  the  Poison  equation  will  be  satisfied  to  first  order.  Will  an  extrapolation  beyond 
the  linear  regime  work?  The  arguments  presented  above  do  not  necessarily  indicate 
that  it  would.  On  the  other  hand,  that  Zel'dovich  was  able  to  find  a successful  model 
should  not  come  as  a complete  surprise. 

Using  the  Zel'dovich  mapping, 


x(x,()  = q + 4(t)u(q),  (3.35) 

the  peculiar  velocity  as  a function  of  q and  t is 

v = aiu,  (3.36) 

and  for  small  displacements  we  require  this  to  agree  with  the  velocity  of  linear  per- 
turbation theory.  Comparing  the  time  dependence  in  equations  (3.30)  and  (3.36),  we 
see  that  6(t)  is  the  growing  mode  (c.g.  b ~ t2/3  for  SI  = 1)  solution  of 

b + 2-i>-i*Gpb  = 0.  (3.37) 

Comparing  equations  (3.30)  and  (3.36)  for  small  displacements,  we  may  write 


u(q)  = — Vq4>(q) , 


(3.38) 


131 


where  4>  is  independent  of  tunc  and  is  proportional  to  the  linear  peculiar  gravitational 
potential  Specifically,  we  have  (note  A ~ b and  p ~ b/a)% 

If  we  factor  out  the  time  dependence  in  ip  by  writing  <p  = b/a  po,  so  that  tso(q)  = 
^(x,t,)  is  the  initial  potential,  then  the  relation  of  the  Lagrangian  mapping,  with 
u = - Vqd>,  to  the  gravitational  potential  is  particularly  simple, 

*w’iS5i"‘S ilc"  <“"> 


Since  the  velocity  is  a gradient  in  q space,  it  is  curl-free  in  material  coordinates.  It  is 
more  important  from  a physical  standpoint  to  note  that  the  spatial  velocity  field  (in 
x coordinates)  is  irrotational,  which  we  show  below. 

The  deformation  of  a material  element  is  described  by  the  tensor  dx/dq  which 
by  equation  (3.35)  is 


and  is  symmetric,  sin 


Jii  = ajj  = sii  + b0jj  (3.41) 

i gradient.  The  Jacobian  determinant  J is  of  fundamental 


importance  since,  by  equation  (3.11),  it  directly  determines  the  density  in  material 
coordinates.  We  may  calculate  it  by  expanding  the  determinant  as  an  antisymmetric 
product 


in  write  in  powers  of  6 after  using  the  following  (jjk  properties. 


fijkfijk  - 3! . £ijkeljk  = 2J,1 , and  lijk^lmk  — buijm 
Thus,  the  determinant  as  a function  of  q and  t is 


.du'  , , 2 1 I du • d u>  an1'  3uA  . ||  du  II 
dq‘  6 2 (a,i  dql  dqj  dq')+b  1 dq  ||  ' 


(3-44) 


132 

We  can  explicitly  check  that  the  density  obtained  from  equation  (3.11)  agrees  to 
first  order  with  linear  perturbation  theory  results.  The  material  coordinate  density 
contrast  in  the  Zcrdovich  Approximation  is 

■S  = ./-1-l  = -&(«)  |£  + 0(62).  (3.45) 

This  agrees  with  equation  (3.29)  since  4 separates  into  a time  dependent  (actor  6(t) 
and  coordinate  factor  giving  the  primordial  density  field, 

<b(q)  = -Vq-u.  (3.46) 

The  above  equation  is  in  material  coordinates,  but  it  conveys  the  same  information, 
to  lowest  order,  as  the  linearised  spatial  coordinate  density-velocity  relation  (from 
continuity),  with  4 = b(t)  4o(x), 


We  have  shown  that  the  mapping  (3.35)  is  consistent  with  the  velocity  and  density 
of  linear  perturbation  theory. 

To  quote  the  master  Zel'dovich  himself,  “the  approximation  proposed  in  this 
article  consists  in  the  extrapolation  of  formula  [(3.28)]  into  the  region  where  the 
perturbations  in  density  [4]  are  not  small"  (Zel'dovich  1970,  p.  84).  This  paper 
is  reprinted  in  his  Selected  Works.  Volume  II.  Particles.  Nuclei  and  the  Universe 
(Zel'dovich  1993),  which  spans  his  pre- World  War  II  Uranium  fission  work  through 
the  cosmology  papers  written  in  his  last  few  decades. 

The  Zel'dovich  approximation  is  a kinematic  model  which  retains  important  fea- 
tures of  the  dynamics.  The  asymmetric  nature  of  gravitational  instability— pancake 
formation— is  predicted  in  a cosmological  context.  The  Eulerian  velocity  field  remains 
curl-free.  The  peaks  in  the  linear  density  field  act  as  attractors.  But  the  nature  of  the 


133 

material  coordinate  deformations  mimics  the  tidal  interaction  of  neighboring  masses, 
and  this  affects  the  shape  (primarily  flattened)  and  connectivity  (cellular  structure) 
of  the  structures  formed.  The  nature  of  perturbation  theory  in  material  coordinates 
allows  for  a simple  scheme  like  the  Zol'dovich  mapping  to  produce  a second  order 
Eulerian  perturbation 

d<2>  = i(i<l>)*  + S j.1* , (3.48) 

which  we  show  below.  The  nonlocal  effects  are  nearly  those  of  the  exact  theory — 
compare  (3.48)  with  the  exact  second  order  expression  (2.166). 

3.2  Pancakes  and  Other  Features 

The  first  and  most  important  consequence  of  the  Zcl'dovich  mapping  is  the  for- 
mation of  pancakes— -relatively  thin  sheet-like  structures  of  high  density.  Due  to  the 
continuity  and  smoothness  of  the  mapping,  the  pancakes  (also  known  as  caustics)  tend 
to  be  somewhat  connected  and  thus  form  a cellular  structure.  The  approximation’s 
pancaking  property  may  be  thought  of  as  a virtue  because  of  the  accumulating  obser- 
vational evidence  that  the  large-scale  matter  distribution  contains  bubbles,  walls,  and 
voids.  The  Poisson  equation  is  not  grossly  violated,  even  when  the  approximation  has 
been  extended  to  the  point  of  pancake  formation,  when  6 is  significantly  greater  than 
1 in  the  dense  regions.  The  proposed  x(q,  t)  mapping  does  a good  job  of  representing 
gravitational  instability  in  an  expanding  universe.  In  feet,  the  approximate  solution 
shares  several  properties,  both  quantitative  and  qualitative,  with  the  exact  solution. 
These  points  become  clearer  as  we  present  some  of  the  details  below. 

Let  us  define  the  material  coordinate  deformation  tensor  as 

, _ 9u‘  d24> 

5 dq>  dq'dqi  ’ 


(3.49) 


134 


which  is  symmetric  by  construction.  Consider  a cubical  material  element  at  initial 
location  q (i.e.  consider  a group  of  particles  near  particle  q),  occupying  volume  tflq, 
with  the  cube  specified  by  the  three  perpendicular  lengths  dq'.  In  general,  the  mate- 
rial element  undergoes  deformation,  and  the  cube  bounded  by  dq'  transforms  into  a 
parallelepiped  bounded  by 


occupying  volume  ||0x/e>q||d3<j.  At  each  point  q,  the  deformation  tensor  may  be 
diagonalized,  producing  three  real  eigenvalues  corresponding  to  three  mutually  per- 
pendicular eigenvectors.  We  denote  the  three  eigenvalues  of  dy  by  — Ay  The  negative 
sign  is  added  by  convention,  following  Shandarin  and  Zel'dovich  (1989).  so  the  A,*  are 
eigenvalues  of  'h  jj  = d^/dtfdq1,  which  describes  the  curvature  of  the  initial  linear 
gravitational  potential.  Without  loss  of  generality  we  order  the  eigenvalues  by 


In  coordinates  which  diagonalize  dy  at  q,  the  q‘  define  the  principle  axes,  and  the 
deformation  is  described  by  the  matrix 


If  the  initial  cube  is  oriented  so  that  its  sides  dq 1 coincide  with  the  new  coordinate 
axes — the  eigenvectors  of  dy — then  the  deformation  is  particularly  simple  to  describe: 
the  cube  suffers  a pure  compression  or  expansion  in  each  of  the  ql  directions.  Thus 
the  cube  transforms  into  a rectangular  parallelepiped. 

The  tensor  *I’y  possesses  three  characteristic  invariants: 


(3.50) 


(3.51) 


(3.52) 


fl  = Ai  + A2  + A3 


Iq  = A1A2  + A2A3  + A3A1 

^3  = A1A2A3 . 


(3.53) 


These  invariants  are  indeed  coordinate  independent,  which  is  shown  by  the  fact  that 


as  of  tensor  contractions. 


h = Jj  ~ *,y*.y)  = hi  - 


(3.54) 


h = ll't'.ijll  = g*5i  - + j*,y*jfc®,K 

= hh  - + i*,y*  jk*M 

The  Jacobian  is  easily  calculated  from  equation  (3.52)  to  be 
J = (1-M,)(1-5Aj)(1-Mj) 

= l-bh+l^h-b3^. 

Therefore,  the  material  coordinate  density  can  be  written  in  the  particularly  revealing 


(3.55) 


1 +4(q,t)  = t- 


(3.56) 


(1  — 6Aj)  (1  — 6A2)  (1  — 6A3)  ’ 
while  trajectories  do  not  cross.  In  the  event  of  multistream  flow,  there  are  mod- 
iflcations  (Shandarin  & Zel’dovich  1989,  Kofman,  Bertschinger,  Gelb,  Nusser,  & 
Dekel  1994),  following  from  the  fact  that  a sum  over  the  various  q values  whose 
trajectories  arrived  at  x would  need  to  be  included  on  the  right  hand  side  of  equa- 
tion (3.7). 


Equation  (3.56)  is  important,  as  it  shows  that  collapse  to  infinite  density  is  in- 
evitable in  regions  where  there  is  at  least  one  positive  eigenvalue  Aj.  If,  as  is  almost 
always  the  case,  we  have  unequal  eigenvalues,  labeled  At  > Aj  > A3,  then  if  A]  > 0 
the  collapse  is  one  dimensional  in  that  deformation  is  primarily  along  the  Ai  eigen- 
vector direction,  giving  a S ->  00  caustic  with  the  eigenvector  as  its  normal.  After 
the  pancake  collapse,  there  follows,  if  A2  > 0,  compression  within  the  sheet  which 
tends  to  form  filamentary  structure.  Finally,  if  A3  is  also  positive,  collapse  along 
filaments  produces  clusters.  These  features  arc  qualitatively  born  out  in  complete  N- 
body  simulations  of  structure  formation  in  an  expanding  universe  (Kofman,  Pogosyan, 


136 

Shandarin.  Sc  Melon  1992).  As  an  illustration,  we  produce  in  Figure  3-1  a somewhat 
generic  numerical  realization  of  the  Zel'dovich  approximation  in  three  dimensions. 
We  have  plotted  the  mass  distribution  in  a thin  slice. 

An  major  problem  of  the  approximation  stems  from  the  fact  that  it  is  ballistic, 
and  so  the  pancakes  dissolve — the  smaller  scale  gravitational  interactions  are  effec- 
tively turned  off  and  particles  do  not  turn  around  and  fall  into  the  potential  wells. 
The  blessing  given  to  the  approximation,  however,  is  directly  related  to  addressing 
this  problem.  Gravitational  instability  within  an  expanding  spacetime  can  be  suc- 
cessfully analyzed  with  spatial  filters,  thus  washing  out  the  small  scale  problems  of 
the  Zel'dovich  approximation  (Kofman  et  al.  1992).  The  weaknesses  of  the  approxi- 
mation are  thus  somewhat  obvious,  and  in  comparing  various  approximation  schemes 
of  gravitational  instability  (some  of  which  we  mention  below)  with  N-body  results, 
the  Zel'dovich  approximation  fares  surprisingly  well  (Coles,  Melott,  Sc  Shandarin 
1993).  The  cross  correlations  between  Zel'dovich  and  jtf-body  simulated  data  are 
quite  encouraging— provided  that  the  initial  fluctuation  spectrum  in  the  Zel'dovich 
simulations  has  been  filtered.  The  surprise  is  that  evolution  of  the  smaller  scale  sub- 
structure does  not  skew  the  large-scale  results  in  such  a way  that  successful  filtering 
breaks  down  on  all  but  the  largest  (linear  regime)  scales.  As  a result,  the  formal  title 
"Truncated  Zel'dovich  Approximation"  is  sometimes  used  in  the  literature  to  reflect 
the  successful  usage  of  filtering  (the  “truncation”  refers  to  large  k suppression  of  the 
(|do(k)|2)  spectrum. 

The  baryonic  pancake  and  neutrino  HDM  models  have  faded.  These  models 
utilized  the  Zel'dovich  approximation  in  a top-down  sense  to  generate  the  large- 
scale  cellular  structure  (cf.  Doroshkevich,  Ryaben’kii,  Sc  Shandarin  1973,  Shandarin, 
Doroshkevich,  Sc  Zel'dovich  1983).  A short  review  which  indicates  the  general  philos- 
ophy of  such  models  is  contained  in  a collection  of  review  articles  by  Zel'dovich  (1992). 


137 


I 


Figure  3-1.  A two  dimensional  slice  from  a three  dimensional  numerical  realization 
of  the  Zol’dovich  approximation. 


138 

A bottom-up,  hierarchical  paradigm  was  developed  by  Peebles  and  others  during  this 
same  period  (cf.  Peebles  1974a,  Peebles  1980).  The  hierarchical  models  had  closer 
ties  with  the  ft  1-10  h~l  Mpc  scale  correlation  data.  The  Press-Schechter  (1974) 
algorithm  for  obtaining  a bottom-up  mass  spectrum  of  collapsed  objects  based  on 
the  linear  density  field  is  a practical  hierarchical  example.  With  closer  collaboration 
between  the  two  schools  of  thought  during  the  1980’s,  coupled  with  the  rapid  improve- 
ments in  computer  simulations,  there  has  been  a merger  of  sorts.  Qualitatively,  the 
bottom-up  features  of  the  hierarchical,  CDM  models  have  survived.  Utilising  filter- 
ing in  Lagrangian  mapping  approximation  schemes  appears  to  give  setni-quantitative 
insight  into  the  formation  of  large  coherent  structures,  which  can  be  called  super- 
pancakes  and  super-filaments  (Kofman  et  al.  1992).  Such  structures  appear  in  the 
simulation  data  as  well  as  in  redsliift  surveys. 

We  now  turn  our  attention  to  some  of  the  features  of  the  Zel'dovich  approxima- 
tion. Once  we  have  the  Zel'dovich  map,  it  is  easy  to  check  that  it  obeys  the  Eulerian 
V x v = 0 condition.  The  curl-free  condition  (3.24)  holds  identically,  which  we  ob- 
serve by  using  the  frame  where  Jij  is  diagonal,  but  note  that  the  condition  becomes 
inaeierminate  (we  divide  by  zero)  once  caustic  formation  occurs;  i.e.  we  need  the 
quantity  1 — b(t ) Aj  (q)  to  be  positive  at  the  point  x(q,  t)  where  the  curl  is  taken. 

The  Zel'dovich  approximate  solution  is  actually  the  exact  solution  in  one  dimen- 
sion, prior  to  multistrearaing.  That  is,  in  the  artificial  case  in  which  perturbations 
depend  on  one  coordinate,  d(x,y,  x)  s <5(x),  corresponding  to  a set-up  for  pancake 
formation  parallel  to  the  yx-plane,  the  exact  dynamics  in  an  expanding  background 
gives  the  Zel'dovich  mapping.  This  is  easy  to  check.  The  one-dimensional  mapping 

s_j.  i t -i  du/dq  (3.57) 
1 +bdu/dq ' 


139 

The  peculiar  acceleration  in  one,  two,  or  three  dimensions  is 
_ dv  a 
9 ~ dt  + aV 

= (2ab  + a5)u  = 4xGpa6  u. 

In  one  dimension,  we  may  simply  compute  the  divergence  of  g to  find 


v-s=s;s;  = -,’rG^' 


(3.58) 


(3.59) 


with  6 given  by  (3.57),  so  we  see  that  the  Poisson  equation  is  satisfied  exactly. 

In  two  or  three  dimensions,  the  Poisson  equation  is  only  met  to  first  order.  We 
may  keep  track  of  the  fractional  error  in  the  Poisson  equation  in  the  following  man- 
ner (Shandarin  & Zel’dovich  1989).  Let  us  define  5.  to  be  the  dynamical  density 
calculated  from  the  divergence  of  g in  the  Zel'dovich  approximation, 


S‘  4irG/ki7  9 ’ 
and  we  define  the  fractional  error  as 


(3.60) 


(1  + 5.)  — (14-3) 
1 + 5 


(3.61) 


where  6 is  J 1 - 1 from  the  material  continuity  equation.  For  the  divergence  of  g we 
write,  using  (3.58), 

1— V a = -b—  — 

"*■  ' ~,W.W  , . , (3-62) 

_ , r Ai  , *2  -*2  I 

[l  — 6Ai  1 — 5A2  1-6A2]  • 

where  we  have  diagonalized  the  deformation  tensor.  Using  expression  (3.56)  for  6,  we 

error  = -b2I2  + 263/3 , (3.63) 

which  grows  with  time  but  which  remains  finite  even  at  caustic  formation,  since  6 and 
6,  diverged  in  the  same  manner.  Equation  (3.63)  shows  that  the  Poisson  equation  is 


satisfied  to  first  order-  Also,  in  the  one-dimensional  case,  both  A2  and  A3  vanish,  so 
that  I2  = 1$  = 0 and  there  is  no  error,  as  we  showed  above. 

When  6 — » 00  in  the  approximation,  we  still  have  a finite  mass  per  unit  area 
in  the  sheet  of  collapsed  matter,  since  the  perturbation  deforms  onc-diraensionally, 
and  the  gravitational  potential  is  finite  and  smooth,  and  the  error  remains  finite 
even  at  collapse.  The  finite  nature  of  the  singularity  at  a caustic  may  seen  from 
generic  considerations  of  smooth  Lagrangian  maps.  Consider  a pancake  forming  at 
*0  = x(qo,  <o)-  We  diagonalize  the  deformation  tensor  Jy  at  qo  as  in  equation  (3.52). 
A sheet  is  forming  at  time  given  by  1 - 6(to)A]  = 0.  We  reduce  to  the  one  dimensional 
problem  for  simplicity,  with  no  essential  lass  of  generality  (Shandarin  & Zel'dovich 


1989),  and  we  let  x be  the  A j eigenvector  direction.  For  material  coordinates  q = 
qo  + Aq  and  Eulerian  coordinates  x = xq  + Ax  near  the  sheet,  we  have 


If  A]  is  a relative  maximum,  then  in  the  single-stream  flow  at  to  near  xo,  we  have 
J(q)  A 92  and  Ax  ~ A g3,  which  gives 


as  the  density  profile  near  the  forming  caustic.  Slightly  after  to,  we  have  triple-stream 
flow.  The  u(q)  function  is  smooth,  and  it  remains  smooth  with  respect  to  the  Eulerian 
coordinates  x,  but  du/dx  goes  to  -00  at  xo,  immediately  after  which  the  pancake 
at  xo  corresponds  to  two  singularities,  near  xo,  between  which  there  is  triple-stream 
flow.  In  that  triple-stream  case  (at  to  + At),  and  in  the  more  generic  case  where  6 
is  blowing  up  but  we  do  not  have  a relative  maximum  of  A]  at  so.  there  is  also  a 
manageable  singularity.  We  have  J(q)  ~ A q and  Ax  ~ Ag2,  which  gives 


(3.64) 


. . dx  1 02x  . >> 


l + 4(x)~|x-xor2/3 


141 


Figure  3-2.  The  growth  of  density  fluctuations  with  time:  the  formation  of  pancakes 
in  a one  dimensional  simulation.  The  linear  evolution,  (d2)  — 62,  gives  (42)1/2  values 
of  0.1,  1,  and  1.5:  the  actual  values  from  the  simulation,  noted  on  the  diagram,  are 
larger  due  to  nonlinear  evolution. 


142 

as  the  generic  density  profile  near  a singularity  at  x„.  These  cusps  may  be  integrated 
over  to  obtain  a finite  mass  per  unit  area  in  a pancake.  The  general  description 
of  the  singularities  of  the  Zei'dovich  mapping  in  one  and  two  dimensions  was  ana- 
lyzed by  Arnold,  Shandarin,  and  Zei’dovich  (1982)  and  placed  within  the  context  of 
catastrophe  theory  by  Arnold  (1992). 

We  have  performed  one-dimensional  numerical  simulations  which  clarify  the  above 
remarks.  In  Figure  3-2,  we  have  evolved  some  rather  generic  initial  conditions  to 
display  caustic  formation.  The  initial  (|4o(fc)|2)  spectrum  was  simply  flat,  with  a 
cutoff  at  large  k.  The  top  density  profile  is  in  the  linear  regime,  with  (tf2)1^2  = 0.1, 
and  the  bottom  profiles  have  (<53)  > 1.  as  shown.  The  single-  and  triple-stream  regions 
near  caustics  are  evident  in  this  picture.  In  Figures  3-3  and  3-4,  we  have  zoomed  in 
on  a pancake  from  another  generic  realization  of  the  approximation.  The  Eulerian 
velocity  field  v/ab  is  shown  to  be  single-valued  in  Figure  3-3,  with  dv/dx  -*  — oo  at 
the  cusp.  In  Figure  3-4,  the  velocity  is  triple-valued  in  the  region  of  the  thickening 
pancake.  Outside  of  multistream  regions,  the  approximation  still  corresponds  to  the 
exact  solution  of  one-dimensional  instability  in  an  expanding  universe. 

evolving  from  generic  initial  conditions  (again,  flat  (|Ao(k)|2)  with  high  k cutoff). 
In  Figure  3-5  we  show  a typical  example,  evolved  until  (52)  ns  1.  We  show  a close-up 
view  of  the  thickening  of  several  pancakes  in  Figure  3-6. 

The  statistics  and  form  of  the  initial  gravitational  potential  clearly  provide  the 
seeds  for  later  structure  formation.  In  view  of  this,  Doroshkevich  (1970)  wrote  a 
seminal  paper  concerned  with  the  spatial  structure  of  an  initial  random  field.  This 
was  a precursor  to  the  famous  BBKS  (Bardeen,  Bond,  Kaiser,  k Szalay  1986)  paper 


Figure  3-3.  Generic  kinematics  or  the  single  stream  Eulerian  velocity  field  v/ab  per- 
pendicular to  a pancake,  from  a one-dimensional  simulation. 


Figure  3-4.  The  pancake  thickens.  Generic  kinematics  of  the  triple  stream  Eulerian 
velocity  field  v/ab  perpendicular  to  a pancake,  from  a one-dimensional  simulation. 


Figure  3-5.  A typical  two-dimensional  simulation.  A somewhat  connected  network 
of  pancakes  is  formed. 


146 


Figure  3-6.  A closer  view  of  smooth  collapse  into  pancakes.  The  top  picture,  which 
has  single-stream  flow  at  the  caustics,  was  evolved  until  (42)1/2  a L The  bottom  pic- 
ture is  further  evolved,  {d2)*^2  a 1.5,  and  we  see  triple-stream  flow  in  the  pancakes. 


shows,  it  is  not  simply  the  peaks  in  the  linear  density  field  which  survive  as  later 

i(x,  t)  = t(t>  (Aj  + A2  + Aj)  = 6(t)  V2® , (3.67) 
and  therefore  we  have  placed  deformation  information  into  the  equation  from  the 
beginning.  The  clearest  counter-example  to  Eulcrian  linear  theory  is  the  case  where 
Aj  > 0 yet  £ A,-  < 0.  There  is  pancake  formation  in  a region  which  would  become 
increasingly  underdense  according  to  linear  evolution. 

For  an  initial  do(q)  held  which  is  Gaussian  (so  the  initial  potential,  velocity, 
and  density  fields  are  ail  Gaussian),  with  oq  =(4q),  Doroshkcvich  found  the  joint 
probability  distribution  for  the  set 

Aj  > A2  > A3  (3.68) 
of  eigenvalues  of  the  deformation  tensor  ‘f’jj, 

^(Ali  A2.  A3)  = (A|  - As)(Ai  - Ag)(Aa  - A3)  exp  | ' ~ ^ ^ j . (3.69) 

This  implies  that  the  probability  that  one  value  is  positive  while  the  other  two  are 
negative  (which  is  equal  to  the  probability  that  two  are  positive  and  one  is  negative) 
is  as  42%.  The  probability  that  all  three  are  positive  (which  is  the  same  as  that 
for  all  three  negative)  is  a 8%.  In  a purposefully  naive  approach,  we  might  ignore 
the  ordering  (3.68)  and  that  the  Aj  are  dependent  (since  they  come  from  the  tjj 
eigenvalue  problem)  random  variables.  Then  we  would  estimate  that  the  probability 
that  all  the  Aj  are  negative  is  simply  (1/2)''  = 12.5%.  That  the  answer  (8%)  is  less  is 
due  to  the  property  of  the  repulsion  of  eigenvalues  (Zel'dovich  1983). 

As  explained  by  Shandarin  and  Zel'dovich  (1989),  the  high  density  regions  in  the 
evolved,  non-Gaussian  density  field  percolate,  despite  the  fact  that  the  dense  regions 
themselves  comprise  but  a small  volume  of  space.  Eventual  sheets,  filaments,  and 


148 

clusters  correspond  to  Ai  > 0 regions  comprising  « 92%  of  q-space.  by  the  above  cal- 
culation by  Doroshkevich.  The  percolation  of  the  deformation  properties  in  q-space 
go  over  into  x-spacc,  since  the  x(q,  t)  mapping  is  smooth.  Consider  the  smoothness 
of  the  initial  conditions  and  the  agreement  of  the  expansion  time  scale  with  the  insta- 
bility time  scale.  These  two  points,  along  with  the  asymmetric  nature  of  gravitational 
instability,  are  combined  in  the  Zel'dovich  approximation  to  yield  cosmological  pan- 
cakes. The  continuity/percolation  argument  above  then  gives  the  cellular  structure; 
further  details  of  such  an  argument  are  provided  by  Sabni,  Sathvaprakash,  and  Shan- 
darin  (1994).  Hierarchical  structure  formation  allows  for  small  scales  to  be  nonlinear, 
while  large  scales  can  usefully  be  analyzed  with  linear  theory.  It  is  interesting  that 
approximations  to  gravitational  instability,  with  suitable  filtering,  are  able  to  bridge 
the  gap,  somewhat,  between  the  the  linear  and  nonlinear  regimes. 

3.3  Perturbation  Theory 

13J — Perturbation  Theory  in  the  Zel'dovich  Approximation 

The  Zel'dovich  approximation  has  its  own  perturbation  theory,  similar  in  struc- 
ture to  the  full  gravitational  equations.  Within  the  confines  of  this  approximation, 
there  is  no  need  to  provide  a perturbation  series  in  v,  the  peculiar  velocity,  be- 
cause, naturally,  the  relation  v = abu  for  the  material  coordinate  velocity  field  holds 
throughout.  The  Eulerian  and  Lagrangian  density  fields  are  expanded,  based  on  the 
exact  mass  conservation  relation  1 + 6 = J~x  and  the  x(q,t)  mapping,  and  we  may 
obtain  the  Eulerian  field 

4(x,l)  = 4(,)+<S(J,+....  (3.70) 


turbatively  from  the  start, 


4>(q,t)  =$(>1  + $(*)  + ...  , 


(3.71) 


— ' -1-  + 0(J3) 

= 41)+42,  + 0(6J), 


(3.73) 


/(*)  = /(q)+to,^j/(q)  + 0(J?), 

2|=*2[K^)J  + “‘£ i(£)  + 5&| 


= 5(<(,))S  + ' (3.77) 


(3.78) 


(3.79) 


— *v,.  «*;■" 


f£x  integral  in  (3.85)  gives  the  nsnal  momentum  conserv 

fo]=[^o(k-Eki)|- 


153 


3.3.2  Oilier  Approximations 

While  the  Zel'dovich  approximation  is  used  in  this  work,  there  are  of  course  other 
variations  in  the  quest  to  approximate  gravitational  instability  beyond  the  linear 
regime.  Many  of  the  various  methods  are  reviewed  by  Sahni  and  Coles  (1995). 

The  Zel'dovich  mapping  plus  glueing  will  at  least  qualitatively  address  the  shell 
crossing  problems  of  the  approximation  (Zel'dovich  1983).  Gurbatov,  Saichev,  and 
Shandarin  ( 1989)  formalized  such  a method,  creating  the  elegant  Adhesion  model. 
With  the  addition  of  a small  viscosity  term  to  the  Euler  equation,  pancake  structures 
are  maintained  due  to  a thin  viscous  boundary  layer.  The  pancake  — t filament  -+ 
clump  evolutionary  sequence  (when  all  the  ‘I'i;  eigenvalues  are  positive)  is  maintained, 
and  structures  such  as  voids  evolve  in  an  environment  leading  to  super-pancakes  and 
super-filaments  (Kofman  et  al.  1992).  The  method  is  beautifully  described  by  Sahni 
et  al.  (1994)  and  Sahni  and  Coles  (1995). 

If  we  use  the  formalism  of  Lagrangian  perturbation  from  the  beginning,  using  the 
perturbation  expansion  (3.71)  for  the  x(q,  t)  mapping,  we  obtain  what  are  sometimes 
called  the  post-Zel'dovich  approximation  (second  order)  and  the  post-post-Zel'dovich 
approximation  (third  order).  The  material  coordinate  equations  (3.24)  and  (3.25) 
need  to  be  solved  order  by  order  in  an  iterative  manner.  To  first  order  the  solution 
for  *1')  is  separable,  leading  to  the  Zel’dovich  mapping.  As  shown  by  Bouchet  et  al. 
(1992)  and  Bouchet  et  aL  (1995),  the  higher  order  solutions  arc  separable  if  12  = 1 
and  nearly  so  when  12  # 1 , and  the  solution  has  remarkably  little  12  dependence.  The 
second  order  *<2>  solution  for  12  = 1 is  determined  by 


which  gives  the  4'<2>  part  of  the  map  since  it  is  a gradient.  Adding  the  (3.89)  contri- 
bution to  the  l/J  expansion  (3.72)  for  the  density  gives  an  Eulerian  d<2)  of  the  same 


154 

form  os  equation  (3.77),  but  the  4.  v coefficients  do  get  shifted  to  the  exact  values  of 
a.  To  illustrate  the  near  separability  in  the  Q ^ 1 case,  we  note  that  the  y,  2 values 
go  to  1 ± k.  where  k is  well  approximated  by  ^H-2/63.  The  Zel’dovich  approxima- 
tion perturbation  theory  displays  the  same  structure  as  the  exact  theory.  Combine 
this  with  the  filtered  approximation's  favorable  cross  correlations  with  A'-body  results 
(Coles  et  al.  1993),  and  we  have  motivation  for  further  numerical  experiments. 


CHAPTER  4 
SIMULATIONS 

4.1  Numerical  Realizations 

The  Lagrangian  mappings  of  material  coordinate  perturbation  theory  are  partic- 
ularly suited  for  numerical  investigations  using  computers.  The  peculiar  gravitational 
potential  is  modeled  as  a Gaussian  random  field  initially,  while  fluctuations  are  small. 
This  implies  that  the  initial  density  fluctuation  and  peculiar  velocity  fields  are  also 
Gaussian.  As  we  illustrated  in  Chapters  2 and  3,  the  potential  f,  density  contrast 
6,  and  the  velocity  v are  related  by  the  equations  of  motion.  By  specifying  the  6 
field  as  a simulated,  pseudo-random  Gaussian  variable  with  specified  2-point  spatial 
correlations,  we  have  the  first  order  fields  which  we  may  evolve  in  time  according 
to  perturbation  theory.  We  go  further  by  extrapolating  the  fluctuations  into  the 
nonlinear  regime  using  the  Zel'dovich  approximation. 

We  discuss  aspects  of  the  numerical  methods  below.  We  show  the  evolution  of 
simulations  into  the  nonlinear  regime,  and  show  the  agreement,  during  the  early 
stages  of  evolution,  with  linear  theory.  To  close  the  chapter,  we  present  pictures 
which  give  a visual  impression  of  the  dependence  of  the  final  matter  distribution  on 
the  initial  correlations  and  also  to  any  biasing  between  the  matter  and  the 


155 


First,  we  clarify  the  definition  of  the  linear  fields  and  their  relation  in  Fourier 


space.  The  density  contrast  is 


«(*■<)  = 6(t)*o(x). 


(4.1) 


The  rescaled  gravitational  potential  4*  is  given  by 


(4.2) 


where  is  the  initial  peculiar  gravitational  potential.  The  linear  solutions  are  related 
by 


The  density  fluctuations  are  specified  in  k-space,  where  the  covariance  function  is 
diagonal.  Since  the  fluctuations  are  independently  distributed  for  different  k values,  it 
is  straightforward  to  lay  down  a <io(k)  field  on  a simulation  grid  in  practice.  The  power 
spectrum  Pg(k),  as  we  introduced  it  in  Chapter  2,  will  specify  the  2-point  correlations, 
and  this  determines  do  completely,  since  all  higher  order  connected  moments  vanish. 
The  x-space  and  k-space  relation  is 


(4.3) 


u = v/fli  = -V$. 


(4.4) 


(4-5) 


(4.6) 


(d(k)d(k'))  = [M?4D(k+k')]  P(k), 


(4.7) 


P(k)  =(|d(k)|2) . 


(4.8) 


157 


The  initial  potential  and  velocity  in  k-space  , 


>00 


*=*£*><“)• 


<■<■*> 


From  the  Fourier  transform  definitions,  we  see  that  the  local  variance  of  each  linear 
field  is  proportional  to  an  integral  moment  of  the  power  spectrum. 


with  the  m = 0,  -1,  and  —2  moments  corresponding  to  the  variances  of  6 , v,  and  <p, 
respectively.  For  the  scaled  initial  fields  do*  u,  and  we  express  this  as 


If  we  wish  to  model  the  large  scales  by  the  Zel'dovich  scale  free  spectrum,  where 
potential  fluctuations  arc  the  same  on  all  scales,  we  choose  Ffo  ~ k,  so  that  the  + 
variance  is  scale  free. 

We  implement  a Lagrangian  mapping  on  a discrete  n X n X n grid  representing  the 
comoving  coordinates.  A uniform  grid  represents  initial  material  coordinates  q.  After 
time  evolution,  the  simulated  data  is  analyzed  with  another  grid  of  x coordinates. 
Fourier  transforms  are  taken  in  the  discrete  sense,  as  noted  in  Chapter  2.  To  optimize 
the  Discrete  Fourier  TVansform  (FFT)  algorithm,  the  grid  size  n is  a power  of  2 such 
as  64  or  128.  The  fundamental  cube  is  taken  to  be  L x L x L in  comoving  dimension. 
With  grid  spacing  Ai  = L/n,  the  grid  coordinates  are  x = x,„,Ax,  where  the  x,-n( 
components  are  integers  in  the  0, 1, ....  n - 1 range.  The  comoviug  wavenumber  k 
is  associated  with  an  n x n x n lattice  in  k- space,  with  grid  spacing  A k = 2x/L, 


(4.10) 


(4.11) 


(4.12) 


(4.13) 


158 


i.e.  k = kj„|  Afc  with  integer  kmf  components  in  the  range  0. 1 n - 1,  which  maps 

to  the  range  -n/2  + 1 — . n/2  by  the  periodicity  of  the  discrete  series.  The  reality 
condition  on  the  x-space  fields  and  the  FFT  symmetries  imply  that  there  arc  a total 
of  nr  independent  real  and  imaginary  components  in  the  kt  > 0 half  of  k-spacc,  so 
A and  A contain  the  same  quantity  of  information.  The  highest  frequency  (called 
the  critical  or  Nyquist  frequency),  associated  with  an  integer  wavevector  component 
of  n/2,  is  knyq  — s/Ax,  and  this  corresponds  to  a wavelength  of  2Ax;  i.e.  critical 
sampling  is  two  points  per  wave.  At  best,  our  frequency  range  in  k-space  will  be 
limited  by  wavelengths  of  2rr /k  = L,  the  fundamental  mode,  to  2ir/k  = 2Ax.  the 
critical  sampling  length. 

For  purposes  of  the  discrete  numerical  work,  it  proves  convenient  to  work  with  the 
grid  coordinates  X(nl  and  kmf  directly,  and  we  omit  the  int  subscripts.  The  x-space 
(.--space  relation  is 


where  the  [. . .)  delta  function  is  cither  0 or  1.  We  normalize  the  initial  do  field  such 
that  the  6(t)do(x)  first  order  evolution  corresponds  to  the  measured  er2  ={d2)  and 
P(k)  ={|d(k)(2)  statistics  in  the  linear  regime.  Such  normalization  doesn’t  require 
any  adjustments;  we  need  only  insure  that  the  linear  perturbation  theory  relations 
among  the  fields  hold  true  on  the  grid.  We  normalize  the  / d3kk_2/^)(k)  integral  out 
of  convenience,  since  it  is  directly  related  to  the  Zel'dovich  mapping.  The  velocity 
expansion  on  the  grid,  which  satisfies  V ■ u = -So,  is 


<(*)-£*(k)ea“*/n 
(d(k)i(k'))  = [dD(k+k')|p(k), 


(4.14) 


(4.15) 


(4.16) 


u(x)  = E3^2*(k)'iJ*k*/n' 


(4.17) 


»n  the  grid  is 

A’-space  deformations  (the  FFT  of  *#)  are 

*.<#)  = 


jd^dk,  J 2nkdk,  J 2dk,  (4.22) 


d0(k)=v/-lnr1fl,(*)ei2'r’ 


(4.23) 


at  each  point  k on  half  of  the  grid-  In  a series  of  simulations,  by  using  the  same 
initial  seed  in  the  pseudo-random  sequence  which  covers  fc-space.  we  may  generate 
realizations  with  the  same  initial  phases  but  differing  power  spectra.  Likewise,  we 
may  obtain  multiple  realizations  of  the  same  initial  power  spectrum,  if  each  simulation 
starts  with  a different  seed. 

1L2 — Linear  and  Nonlinear  Seales 

We  now  show  the  evolution  from  the  linear  to  the  nonlinear  regimes.  We  first 
show  the  transition  to  nonlinear  fluctuations  in  two  dimensions,  simply  because  the 
visual  effect  is  clearest.  Simulations  in  one  and  three  dimensions  follow.  Although 
3-d  simulations  have  the  least  resolution — typically  a 1283  point  grid — they  will  be 
emphasized  the  most  throughout,  since  the  universe  is  three-dimensional. 

The  linearly  evolved  rms  local  density  contrast  oj  at  the  grid  scale  is 


Note  that  here,  and  below,  we  may  substitute  dtk  for  d3k  if  needed,  as  in  equa- 
tion (4.22).  Let  us  use  the  condition  6 =s  1 to  specify  the  onset  of  nonlinearity  at  a 
length  scale  k^1.  This  leads  to  the  following  definition  for  k„i, 


The  lower  limit  0 here  could  in  principle  refer  to  a comoving  horizon  scale;  practically, 
the  fundamental  mode  k = 1 defines  the  lower  limit  in  the  sums  over  k modes.  Small 
scales,  k *„f,  have  gone  nonlinear  at  time  6(e)  given  by  (4.25).  The  nonlinear 
scale  kni  is  a useful  measure  time  evolution,  in  that  decreases  with  time;  i.e.  as 
time  goes  on,  larger  and  larger  scales  become  nonlinear.  The  linear  42(i)fij (*)  time 
evolution  of  the  power  spectrum  will  hold  for  k < 


b2(t)  jf  !%(*:)  A - 1. 


(4.25) 


161 

The  nature  of  the  Zel'dovich  approximation  and  aspects  of  hierarchical  clustering 
accessible  by  high  * filtering  are  illustrated  in  Figures  4-1  through  4-5.  which  show 
the  matter  distribution  for  a series  of  two-dimensional  simulations.  The  initial  power 
spectrum  is  ~ k with  a sharp  cutoff  at  kc. 

We  use  a cutoff  kc  = 10  and  show  the  matter  distribution  at  five  epochs,  defined 
by  Of  = 0.1,  0.5,  1, 1.5,  and  2 according  to  (4.24).  Statistical  signals  may  be  detected 
for  data  with  a/  < 0.1.  but  we  do  not  display  such  data  since  they  appear  nearly 
uniform  to  the  eye.  These  data  are  shown  in  the  top  panels  of  Figures  4-1  through 
4-5.  The  evolution  from  the  grid  to  a hint  of  structure  is  shown  in  the  first  two 
figures.  Figure  4-3  shows  the  typical  Zel’dovich  approximation  extrapolation,  as  we 
have  kni  = kc.  The  last  two  show  the  approximation  pushed  a bit  too  far.  In  the 
lower  panels,  the  same  five  6(1)  times  are  used,  but  the  filtering  has  been  performed  at 
the  nonlinear  scale.  The  five  *„|  values  are  100,  20,  10,  7,  and  5.  At  each  successive 
epoch,  larger  scales  arc  nonlinear.  The  oj,  6(1),  and  k„ ( values  for  the  fixed  filter 
kc  = 10  sequence  are  provided  in  Table  4-1.  The  corresponding  evolution  parameters 
in  the  variable  filter  *c  = *m  sequence  are  shown  in  Table  4-2. 

The  k„i  parameter  indicates  the  degree  of  evolution.  For  simulations  with  different 
initial  power  spectra,  we  evolve  to  the  same  kni  in  order  to  have  analogous  evolution. 
The  meaning  of  kni  is  more  robust  in  the  case  of  full  JV-body  simulations,  since  high-* 
filtering  is  not  required  as  it  is  in  the  Zel'dovich  approximation.  Yet  we  still  find  it  to 
be  useful  in  our  simulations.  Note  that  the  values  in  Table  4-1  are  not  the  same 
as  those  in  Table  4-2  even  though  both  tables  describe  /))  — * spectra;  this  is  simply 
because  the  six  different  pre-filters  we  use  in  Table  4-2  amount  to  six  different  spectra. 
Specifically,  our  normalized  initial  spectra  are  obtained  from  unnormalized  P„(*), 
as  can  be  seen  in  equation  (4.19),  by 


Figure  4-1.  Two  dimensional  evolution,  part  1.  This  is  the  first  in  a scries  of  five 
plots  (4-1-4-5)  which  show  evolution  from  an  initial  /5o  - k spectrum.  The  top 
plots  have  a fixed  cutoff  kc  = 10,  while  the  bottom  plots  use  kc  = *,,/  (cutoff  at  the 
nonlinear  scale).  Top:  <7j  = 0.1.  Bottom:  k„i  = 100. 


Figure  4-2.  Two  dimensional  evolution,  part  2.  Top:  a\  = 0.5.  Bottom:  k,,/  = 20. 


164 


Figure  4-3.  Two  dimensional  evolution,  pan  3:  Of  = 1 and  k„i  = 10. 


165 


Figure  4-4.  Two  dimensional  evolution,  part4.  Top:  oj  = 1.5.  Bottom:  k„/ 


166 


Figure  4-5.  Two  dimensional  evolution,  part  5.  Top:  Oj  = 2.  Bottom:  knf 


167 


Figure  4-6.  Growth  of  unit  cell  density  fluctuations  with  time,  in  two  dimensions. 
The  dashed  line  is  the  2-d  linear  perturbation  theory  result.  The  solid  line  and  data 
points  are  the  measured  (<52)'^2  results  from  a 2-d  simulation. 


168 


Table  4-1  Linear  evolution  parameters  for  Pa  ~ kl  with  kc  = 10  in  2-d. 


Q) 

0.01 

0.1 

0.5 

1.5 

2 


K0 *nl_ 

2.62  x lO-4  215 


0.0131  16 

0.0262  10 


Table  4-2  Linear  evolution  parameters  for  ~ k1  with  kc  s t-n|  in  2-d. 


n wo 

1 2.62  x 10-4  > 256 

1 2.62  x 10-3  100 

1 0.0131  20 

1 0.0262  10 


0.0523 


where  Pu{k)  has  been  filtered.  We  use  isotropic  pre-filters.  (l(k)  d(k)H'(*.  *y), 
with  W(k,kj)  = 1 for  small  k.  The  P,,(k)  therefore  already  includes  a |IV|2  factor. 
In  the  present  instance,  the  spectra  differ  only  in  their  sharp  cutoff  filtering  scale  kf, 
but  that  is  a significant  difference,  as  can  be  seen  in  the  figures.  Choosing  kf  = *„j 
is  natural  but  not  strictly  required.  If  kf  < k„i,  then  in  general  the  Zel’dovich 
approximation  is  pushed  too  far,  as  shell  crossing  begins  to  wipe  out  the  clustering. 
If  kf  ct  kni,  then  the  approximation  is  well  within  the  single-stream  region  and  there 
is  not  much  clustering.  We  take  the  Po  in  the  integrand  of  equation  (4.25)  to  be 
CPu(k)  but  without  any  filter,  though  this  last  distinction  is  only  relevant  in  the 
cases  where  kf  < *„i  (slight  evolution).  For  instance,  it  allows  us  to  take  the  scaling 
for  P)  ~ kn  (in  d dimensions) 

(4.27) 

to  be  valid  for  various  *//*„(  choices.  For  our  fully  evolved  simulations,  we  take 
*/«*>■(■ 

The  lower  sequence  of  plots  in  Figures  4-1  through  4-5  shows  a nontrivial  hi- 
erarchical evolution  of  structure  formation.  Full  A'-body  simulations  do  not  need 
initial  high-*  filtering.  The  fortunate  fact  is  that  the  W-body  results  at  successive 
epochs  correlate  well  with  *„/  filtered  Zel’dovich  approximation  matter  distributions, 
provided  the  final  clumpy  A'-body  distribution  is  smoothed  (Coles  et,  al  1993),  The 
correlations  are  more  favorable  for  spectral  index  values  n = 0,  -1,  and  -2  which  con- 
tain more  initial  small-*  power.  Interestingly,  there  is  a piece  of  empirical  evidence 
which  helps  explain  the  perhaps  unexpected  usefulness  of  a hierarchical  pancaking 
model.  The  fully  evolved  A'-body  gravitational  potential  ip  is  found  to  resemble  the 
*nf  smoothed  initial  <po  more  than  it  resembles  the  unfiltered  y'o  (Pauls  & Melott 
1995).  This  at  least  indicates  how  the  Zel’dovich  approximation  moves  matter  to  the 
right  place,  overall. 


170 

The  transition  from  the  linear  to  the  nonlinear  regime  is  displayed  in  Figure  4-6. 
where  we  show  the  increase  of  the  local  rms  fluctuations  with  time.  The  dashed  curve 
in  Figure  4-6  is  Of  vs.  b(t)  as  given  by  equation  (4.24).  The  unit  cell  fluctuations, 
measured  from  a sequence  of  Pi)  ~ k simulations  on  a 512*  grid,  agree  with  linear 
theory  early  on.  as  they  must.  We  use  cloud-in-cell  weighting  to  approximate  a fine 
matter  distribution  on  the  simulation  grid.  On  a 512*  grid,  using  *c  = 10  on  our 
spectrum,  the  mean  displacement  drms  is  only  ns  1.3  unit  cells  when  oj  = 0.1,  for 
example,  so  there  are  still  some  vestiges  of  grid  effects  prior  to  that.  The  nonlinearities 
inherent  in  the  Zel'dovich  approximation  (mode  coupling)  cause  the  disagreement 
with  linear  theory  beginning  at  « 0.3.  The  peak  (and  particularly  the  decrease  after 
the  peak)  in  Figure  4-6  illustrates  the  breakdown  (shell  crossing)  of  the  approximation. 
The  criterion  ai  = 1 for  a fully  extrapolated  Zerdovich  approximation  is  simple  yet 
effective.  The  farthest  one  could  reasonably  push  the  approximation,  corresponding 
to  the  peak,  is  Of  = 1.2  in  this  instance. 

Similar  considerations  hold  in  the  case  of  one-dimensional  simulations.  We  il- 
lustrate a spectrum  Pq  = *2  with  a cutoff  *c  = 10  on  a 32,768  cell  1-d  grid.  The 
hierarchical  effect  is  shown  in  higher  resolution  in  Figures  4-7-4-9.  The  top  1 +6  vs.  x 
plots  show  the  Of  = 0.1, 1,  and  2 evolution  (with  kc  = 10).  The  bottom  plots  are  fully 
evolved  with  the  cutoff  at  i„/,  using  the  same  three  6(t)  values.  The  unit  cell  rms 
fluctuations,  shown  in  Figure  4-10,  agree  with  the  Oj  ~ 6(f)  1-d  perturbation  theory 
early  on,  and  the  nonlinearities  show  up  for  (62)1/2  w 0.1-1,  but  this  local  statistic 
gets  a bit  sloppy  after  that  due  to  shell  crossing.  However,  (i2)1^2  does  continue  to 
increase  with  6(f)  until  slightly  past  Of  = 1,  and  such  a nonlinearity  criterion  still 
proves  useful.  Incidentally,  a double  hump  feature  at  shell  crossing  is  a typical  effect 
in  1-d  simulations  (using  a finer  grid  or  averaging  over  multiple  realisations  generally 
will  not  remove  such  an  effect,  but  smoothing  will  remove  it). 


Figure  4-7.  One  dimensional  evolution,  part  I.  This  is  the  first  of  three  plots  (4- 
7-4-9)  which  show  evolution  from  an  initial  ft)  ~ k2  spectrum.  The  top  plots  have 
a fixed  cutoff  kc  = 10,  while  the  bottom  plots  use  kc  — k„i  (cutoff  at  the  nonlinear 
scale).  Top:  oj  = 0.1.  Bottom:  = 100. 


172 


Figure  4-8.  One  dimensional  evolution,  part  2: 


nd  <=n/  = *c  = 10- 


173 


Figure  4-9.  One  dimensional  evolution,  part  3.  Top:  oj  = 2.  Bottom:  k„i  = 5. 


Figure  4-10.  Growth  of  unit  cell  density  fluctuations  with  time,  in  one  dimension. 
The  dashed  line  is  the  1-d  linear  perturbation  theory  result.  The  solid  line  and  data 
points  are  the  measured  (d2)1'2  results  from  a 1-d  simulation. 


175 


We  are  obliged  to  focus  on  three  dimensional  simulations.  On  a 1283  cell  grid,  we 
begin  with  71)  — k~l  correlations  and  a cutoff  at  kc  = 10.  The  evolution  of  structure 
is  shown  in  Figures  4-11-4-13.  which  show  slices  from  a sequence  of  3-d  simulations.  In 
each  case,  the  slice  thickness  is  L/ 64  (where  L is  the  cube  dimension).  The  evolution 
of  fluctuations  with  time  is  shown  in  Figure  4-14. 

There  is  more  large-scale  power  visible  in  the  slices  in  Figures  4-11-4-13  than  in 

that  we  have  chosen  to  illustrate  a Pj  ~ k'1  spectrum,  which  decreases  with  k. 
The  secondary  reason  is  that  we  are  using  a 2-d  slice  as  our  means  of  visualization. 
The  density  field  covers  an  entire  cube,  yet  we  cut  through  a thin  slice  for  our  plots. 
Three  dimensional  structures  forming  perpendicular  to  the  slice,  e.g.  the  plane  of  a 
pancake  coinciding  with  the  plane  of  the  slice,  augment  the  power  measured  in  two 
dimensions.  The  extra  large-scale  power  is  not  always  visibly  clear  in  one-dimensional 
pencil  beam  sub-samples  or  two-dimensional  slice  sub-samples  of  a three  dimensional 
field.  However,  we  find  that  the  extra  small  k power  in  estimators  confined  to  1-d  or 
2-d  is  noticeable  compared  to  a 3-d  P(k ) estimator. 

These  projection  effects  can  be  estimated  as  follows.  We  have  a full  three  dimen- 
sional realization  with  density  field  5(x),  and  we  excise  a thin  slice  in  the  -TV-plane  as 
our  sub-sample,  with  the  2-d  density  denoted  by  d'(x).  The  power  in  the  sub-sample 
is  defined  by 

(<'(qi)d'(q2))2l(=  [ (<U  + «)  ] fVd(n)  > (4.28) 

where  the  2-d  wavevectors  are  denoted  by  q.  If  we  compute  the  left  hand  side  of 
(4.28),  confining  x to  the  z — 0 plane,  we  find  that  P^dil)  is  obtained  directly  from 
the  full  3-d  P(k),  but  that  k vectors  perpendicular  to  the  slice  are  contributing.  Thus. 


176 


Figure  4-11.  Three  dimensional  evolution,  part  1.  This  is  the  first  of  three  plots  (4- 
11-4-13)  which  show  evolution  from  an  initial  Pi)  ~ t_l  spectrum.  Each  plot  shows 
a slice  of  thickness  L/ 64  from  a simulation  on  a 1283  grid.  The  top  plots  have  a fixed 
cutoff  kc  = 10,  while  the  bottom  plots  use  kc  = An;  (cutoff  at  the  nonliuear  scale). 
Top:  <7 1 = 0.5.  Bottom:  k„/  = 23. 


177 


Figure  4-12.  Three  dimensional  evolution,  part  2:  <r/  = 1 and  kni  = 10. 


Figure  4-13.  Three  dimensional  evolution,  part  3.  Top:  O;  — 2.  Bottom:  kni  = 4. 


179 


Figure  4-14.  Growth  of  unit  cell  density  fluctuations  with  time.  The  dashed  line  is 
the  linear  perturbation  theory  result.  The  solid  line  and  data  points  are  the  measured 
(62)1*1  results  from  a 3-d  simulation. 


(4.29) 


180 


%(«)  = / ^P((92  + *?)I/2] 


pencil  beam  (which  we  take  to  be  on  the  line  y = z = 0).  In  analogy  with  the  above 


where  1-d  wavevectors  are  denoted  by  q.  If  we  take  a 1-d  sub- sample  from  a 2-d 
realization  of  a density  field  with  known  power  l\d-  we  obtain 


More  relevantly,  we  compute  the  left  hand  side  of  (4.30),  assuming  a 1-d  sub-sample 
of  a full  3-d  realization,  and  we  obtain 


These  slice  and  pencil  effects  can  be  viewed  as  k- space  projections  in  the  following 
sense.  In  the  case  of  the  slice,  we  have  a 3-d  5(x)  field,  and  we  multiply  it  by  a 
function  which  is  zero  but  for  a thin  slice  near  ; = 0.  In  k-space,  the  4(k)  field  is 
then  convolved  along  kz  with  a broad  function.  The  above  formula  are  simple,  since 
we  multiplied  d(x)  by  a delta  function  (which  has  1 as  its  transform).  If  W(s)  or 
H'(a.K)  are  the  functions  which  produce  a slice  or  pencil  of  finite  thickness,  then  the 
P(k)  integrals  in  (4.29)  and  (4.32)  would  change  according  to 


(4'(qi)d'(qj)),j  = [^«o,(qi  + q2)]f’i,/(9i) , 


(4.30) 


(4.32) 


P{k)  dk  P(k)  | W|(k2  - ,2)‘/2l  |2dk 


(4.33) 


181 


where,  for  example,  we  have 


M/2 


(4,34) 


for  a slice  of  thickness  d in  the  xp-plane. 

Actual  redshift  surveys  arc  often  in  the  form  of  beams  or  slices,  so  these  issues  are 
relevant  in  practice  (Kaiser  & Peacock  1991),  so  as  to  not  overestimate  large-scale 
power.  When  implementing  Lagrangian  approximations,  we  find  that  it  is  best  to 
use  full  3-d  simulations  and  then  excise  sub-samples  if  needed.  While  it  is  true  that 
the  properties  of  the  initial  gravitational  potential  are  closely  tied  to  the  final  caustic 
locations,  in  one,  two,  or  three  dimensions,  we  find  that  the  difference  in  the  evolution 
of  clustering  between  the  1-d,  2-d,  and  3-d  cases  indicates  that  3-d  simulations  are 
the  way  to  go.  This  is  somewhat  obvious,  but  it  needed  to  be  investigated  given  the 
potential  convenience  of  simulating  P[,i  and  spectra  directly.  Anyway,  our  main 
focus  is  on  full  3-d  samples,  and  the  slices  we  show  are  simply  a helpful  visual  tool. 

To  investigate  how  far  one  may  usefully  push  the  approximation,  we  measured  the 
local  fluctuations  a =(42)*'2  versus  6(t)  for  various  initial  spectra  in  three  dimensions. 
For  Pq  — kn,  — 3 < n < 1,  we  show  the  evolution  of  the  local  fluctuations  in  Figure  4- 
15.  The  linear  theory  result  is  also  shown  for  the  n = —3  case,  plotted  using  dashed 
curves.  The  maximum  evolution  which  could  possibly  be  useful  corresponds  to  the 
peaking  of  the  curve  at  bmax,  with  a(6m0I)  = 2,6  for  n = -3  and  a(thnaz)  = 2.0 
for  n = 1.  These  maximum  points  are  marked  with  asterisks.  While  perturbative 
results  are  in  force,  one  would  expect  the  a vs,  b(t)  curves  to  possess  a positive  second 
derivative  d2o/d52,  since  a perturbative  expansion  would  necessarily  involve  powers 
a — If  (beyond  linear  order  p = 1).  The  a vs,  b inflection  points  arc  marked  with 
triangles.  Inflection  points  in  o2  vs.  b could  also  be  used,  with  similar  results  (such 
points  occur  slightly  later).  The  simplest  criterion,  based  on  extrapolating  linear  to 


182 

its  upper  limit,  is  ai  = 1.  and  these  points  arc  plotted  with  solid  squares  in  Figure  4- 
15.  Using  sharp  kc  filtering  in  the  Zel'dovich  approximation,  choosing  o/  = 1 is  the 
same  as  setting  kc  = !'„/■  The  curves  in  Figure  4-15  display  a scaling  according  to 
o i -»  5(1).  This  is  shown  in  Figure  4-16.  where  we  plot  a vs.  o/  for  the  same  five  data 
sets.  Note  that  for  the  scaling  to  be  accurately  portrayed  in  Figure  4-16.  one  must 
compute  the  integrals  used  in  the  definitions  of  Po{k)  and  o;  as  discrete  sums  Eg 
rather  than  integrals  [ 4 wk2dk.  Using  integral  expressions  is  generally  fine,  leading 
to  errors  on  the  order  of  a percent,  but  especially  for  indices  n < 0.  discrepancies  will 
show  up  in  direct  comparisons  with  simulations. 

The  Zel'dovich  approximation  can  be  pushed  to  rms  fluctuation  values  of  a a:  2- 
3 before  the  approximation  breaks  down  via  shell  crossing  in  the  simulations.  The 
high  density  structures  (depending  on  the  grid  resolution)  can  reach  dmox  as  lO'-lO2. 
Such  a dynamic  range  is  smaller  than  that  of  a full  W-body  simulation,  but  it  provides 
a useful  probe  into  the  nonlinear  regime  under  controlled  circumstances.  Also,  early 
epochs  (where  all  scales  remain  in  the  linear  regime)  may  be  simulated  so  that  we 
may  witness  the  transition  to  nonlinearity.  Generally,  in  the  remaining  pictures  shown 
in  this  chapter  and  in  the  statistical  analyses  of  Chapter  5,  we  concentrate  on  fully 
evolved  Zel’dovich  simulations.  We  let  the  scale  of  filtering  kj  coincide  with  the  scale 
of  nonlinearity  k„i  and  push  the  approximation  to  its  limits.  Even  then,  the  scales 
with  r £ k~j  exhibit  linear  behavior,  and  the  quasi-linear  regime  (which  bridges  the 
linear  and  highly  nonlinear  regimes)  may  be  investigated. 

We  now  illustrate  two  of  the  most  basic  inputs  into  any  cosmological  simulation. 
First,  there  is  the  initial  power  spectrum  Po(k),  which  completely  specifies  the  initial 


1.  0.  -1,  -2.  -3 


Figure  4-15-  Measured  growth  of  density  fluctuations  for  various  power  indices  in 
3-d  simulations.  The  leftmost  (dotted)  curve  corresponds  to  n = 1.  Successive  curves 
to  the  right  apply  to  n = 0,  - 1.  -2,  -3.  For  n = -3  (dashed  curve)  we  also  plot  the 
linear  theory  result. 


Figure  4-16.  Measured  growth  of  density  fluctuations,  scaled  to  (JI-  The  same  data 
used  in  Figure  4-15  are  plotted  using  o/  — 6.  The  linear  perturbation  theory  result 
o — o\  (dashed  line)  is  also  shown. 


spatial  correlations  for  the  Gaussian  Held  5o(x).  Secondly,  there  is  the  biasing  pre- 
scription which  determines  the  nature  of  the  galaxy  sample  to  be  analyzed.  If  we  vary 
the  bias  or  the  initial  power  between  two  simulations,  sometimes  the  distinction  be- 
tween them  is  visually  clear.  On  the  other  hand,  sometimes  two  different  simulations 
look  about  the  same  and,  though  different,  are  difficult  to  distinguish  clearly  due  to 
noise  in  the  statistical  measures.  Most  importantly,  there  are  cases  where  relatively 
small  changes  in  the  initial  parameters  arc  clearly  recognizable  in  the  Hnal  statistical 
estimators.  The  3-point  function  (presented  in  the  following  chapter)  is  an  important 
example  of  a potentially  discriminating  statistic. 

We  assume  that  we  have  an  evolved  matter  distribution,  with  density  p = p(l+i5), 
and  that,  based  on  this,  there  is  a galaxy  distribution.  The  number  density  of  the 
galaxy  distribution  is  given  by  ng  = fi(l  + 6g),  and  we  must  assume  that  there  is  at 
least  some  functional  relationship  between  5(x)  and  d9(x).  As  an  approximate  model, 
we  use  a local  bias. 


Even  if  the  function  / is  not  known,  6 and  6g  might  be  close  enough  after  smoothing 
that  a few  terms  in  a Taylor  expansion 


will  suffice.  We  have  (i9)  =(d)  = 0,  and  bo  is  used  to  insure  this  zero  average;  bo  plays 
no  role  in  the  statistical  moments.  The  simplest  case  is  linear  bias,  where  6g  = b\6, 
and  such  an  effect  is  straightforward  to  trace  through  perturbation  theory  results.  For 
example  we  have  P(k)  t-t  ti^P(k).  The  linear  v field  in  terms  of  6g,  from  equations 
(2-103)  and  (2.109)  in  §231.4,  is 


<*»(*)  = /(*(*))  • 


(4.35) 


(4.36) 


(4.37) 


which  contains  the  typical  combination  3 = of  the  Q and  6]  parameter.  For 

example,  averaging  over  a spherical  overdensity  of  radius  R (as  in  early  Virgo  infall 
estimates),  vve  have  an  infail  velocity  ti  given  by 

A sharp  threshold  bias,  where  <S  is  clipped  to  zero  for  values  below  <5(,  is  an  example 
of  a nonlinear  biasing  procedure.  For  an  underlying  Gaussian  field,  the  the  threshold 
model  has  been  investigated  analytically  (Kaiser  1984,  Politzer  & Wise  1984,  Bardeen 
et  al.  1986,  Jensen  it  Szalay  1986).  For  underlying  local  rms  fluctuations  <ro  and 
threshold  ft  = t/ag,  Kaiser  (1984)  found,  for  v > 1 and  og  < 1.  a biased  correlation 
function  { j = 6^/oJ  and  6„  = u.  Jensen  and  Szalay  (1986)  extended  this  result  for 
b„  to  include  small  e: 


= erfeft//  v/2) , (4.39) 


where  F -*  1 for  large  v and  F — * for  small  n.  A (actor  acts  as  a linear 
bias  parameter  fcj,  and  one  might  expect  measured  power  to  be  amplified  by  such  a 
6}  factor  in  simulations,  though  our  evolved  density  fields  are  not  Gaussian  (we  find 
this  to  be  so.  qualitatively,  in  Chapter  a). 

Going  to  second  order  in  perturbation  theory — e.g.  as  needed  for  the  determi- 
nation of  the  3-point  function — vve  need  to  include  the  first  nonlinear  bias  term  so 
that  the  biasing  (4.36)  is  carried  out  to  the  same  order  as  the  perturbation  theory. 
A modified  hierarchical  structure  for  the  :V-point  functions  persists  even  with  such  a 
bias  carried  out  to  all  orders  (FYy  & Gaztaiiaga  1993).  Keeping  up  to  62  in  the  bias, 


is  = b0  + blS  + ^621 


(4.40) 


where  to  = -t2(t2>/2.  We  show  the  biasing  effect  on  the  first  few  local  density 
moments.  Recall  that  the  connected  moments  are 

<<*)«=<**)  (**>,-<<*>  (t4)  ={44)e+3(tV-  (4-41) 

Using  this  order  of  bias,  we  have 

<«J)  = t?(t2)  + t,t2(d3)  + |(42)2  + |(t4)c . (4.42) 

where  the  second  term  shows  the  small  effects  of  nonlinear  bias  on  the  second  moment, 
and  the  last  two  terms  are  actually  superfluous  unless  one  also  includes  a 63  bias  term. 
For  the  third  moment,  we  find 

(43)  = 4}(d3)  + 31>562(<S2)2.  (4.43) 

The  reduced  hierarchical  amplitude  is  thus 


to  this  order  in  perturbation  theory  and  nonlinear  bias.  We  utilise  a similar  effect 
on  the  Fourier  space  3-point  correlations  in  the  next  chapter.  To  get  a handle  on  the 
biasing  of  our  simulation  data  sets,  we  utilize  these  biasing  parameters  61  and  62. 

In  practice,  to  implement  a bias  we  impose  a Poisson,  or  Monte  Carlo,  realization 
of  the  evolved  density  field.  The  simulated  density  field  is  already  discrete,  but  it  has 
as  high  a resolution  as  is  practical;  the  resulting  galaxy  distribution  is  much  more 
diluted  or  discrete.  Without  bias,  galaxies  are  simply  laid  out  with  a probability 
proportional  to  the  local  value  of  d(x);  be.  in  practice  we  can  take  the  discrete  matter 
distribution  from  the  simulation  and  simply  dilute  it  uniformly  randomly.  For  exam- 
ple, if  there  are  ss  106  simulation  particles,  we  simply  eliminate  99%  of  them  randomly 
if  we  want  « IO1  galaxies  in  our  simulated  sample.  Using  this  unbiased  procedure, 
the  galaxy  statistics  should  be  the  same  as  the  full  d(x)  statistics,  and  while  there  is 


measurable  signal,  we  find  this  to  be  true  (if  we  account  for  discreteness  effects).  To 
implement  bias,  the  Poisson  procedure  is  of  course  explicitly  dependent  on  the  local 
5{x)  value.  Favorite  mechanisms  include  some  sort  of  binary  threshold,  whereby  a 
cell  contaius  a galaxy  if  6 > 6C  and  a cell  is  empty  otherwise,  for  some  cutoff  6C. 
One  direct  method  is  to  keep  or  reject  particles  of  the  simulation  according  to  some 
predefined  probability  P(6)\  for  example  we  may  a power  law  V ~ //'. 

Some  insight  into  the  effects  of  the  initial  Pq  spectrum  can  be  obtained  by  in- 
specting slices  from  3-d  simulations,  when  there  are  variations  in  the  initial  power 
with  a coherence  scale  corresponding  to  kc  = 12.  Figures  4-17  through  4-23  show 
a set  of  realizations  with  the  same  initial  phases  but  differing  spectra.  In  each  case 
we  evolve  the  simulation  fully,  to  ai  = 1.  corresponding  to  1',,/  = Ay  since  we  trun- 
cate sharply.  The  slices  are  of  thickness  L/ 64.  from  1283  grid  simulations.  In  each 
figure,  we  plot  the  linearly  evolved  spectrum  P = 62(t)P)  below  the  slice.  We  be- 
gin by  showing  minimal  Pi j ~ A-4  fluctuations,  in  Figure  4-17.  A Zcl’dovich  /fo  ~ k 
spectrum  is  used  in  Figure  4-18.  There  are  strong  cross  correlations  in  terms  of  the 
major  structures,  though  the  measured  2-point  statistics  are  clearly  different.  This 
trend  continues  in  Figures  4-19  and  4-20,  but  there  is  more  large-scale  power,  since 
the  spectrum  is  a flat  Pq  = constant  in  the  first  case  and  constant  from  k = 6 to 
* = 12  (denoted  Po  ~ hat(6, 12)  since  this  is  a “top-hat”  spectrum)  in  the  second. 
Three  negative  power  indices,  Pq  ~ kn  with  n = -1,  —2,  and  -3,  are  shown  in 
Figures  4-21  through  4-23.  Such  a simulation  does  not  correspond  to  a fair  sample. 
Measured  power  from  redshift  surveys  has  turned  over  from  n = 1 to  n ss  -1,  -2  on 
50 /r  Mpc  scales,  so  negative  indices  are  quite  relevant.  Because  the  initial  phases 
are  the  same,  some  of  the  same  structures  persist  also  in  these  figures. 

Melott,  Pellman,  and  Shandarin  (1994)  showed  that  the  cross  correlations  between 
the  Zel’dovich  approximation  and  .Y-body  simulations  are  improved  if  the  A-space 


Figure  4-17.  Power  spectrum  Ffo  ~ k*.  sharp  cutoff  kc  = 12. 


190 


Figure  4-18.  Power  spectrum  Fb  ~ i,  sharp  cutoff  kc  = 12. 


191 


1 10 


Figure  4-19.  Power  spectrum  Pa  = i 


t,  sharp  cutolf  kc  = 12. 


192 


1 10 


Figure  4-20.  Power  spectrum  Pb  ~ hat(6, 12). 


193 


Figure  4-21 . Power  spectrum  fb  ~ 1-" 1 , sharp  cutoff  kc  = 12. 


194 


1 10 


Figure  4-22.  Power  spectrum  /fo  ~ k 2,  sharp  cutoff  kc  = 12. 


195 


1 10 


k 


Figure  4-23.  Power  spectrum  Pq  ~ k~3,  sharp  cutoff  tc  = 12. 


196 


Figure  4-24.  Power  spectrum  Po  ~ fc,  Gaussian  cutoff  kg  = 12. 


Table  4-3  Evolution  parameters  for  Pq  ~ kn  with  sharp  k„i  cutoff. 


198 


truncation  is  performed  with  a Gaussian  filter.  The  truncation  idea  is  simply  that 
J0(k)>-*4(lt)li'(fc,*/)  (4.45) 

for  some  isotropic  filter  which  suppresses  k£,kt  power.  The  Gaussian  filter  is  given 

IV  = exp(-fca/2*g.) . (4.46) 

The  simplest,  yet  effective  (Coles  et  al.  1993),  filter  is  the  sharp  step  function. 
IF(k,  kc)  = 0 for  k > kc-  For  any  kni  (Melott  et  al.  (1994)  found  few  differences 
between  kni  = 4 and  k„i  = 8 for  example)  a good  filter  is  kg  ss  1.25*-„j  for  Po  ~ k" 
with  n = —2,  —1,  and  0;  and  kc  C;  for  n — 1.  For  example,  using  /\> . — A and 
k„  1 = 12.  as  in  Figure  4-18,  we  choose  a Gaussian  window  with  kc  = 12  and  plot  the 
results  in  Figure  4-24.  This  correlates  well  with  Figure  4-18  and  with  a full  k 
(no  cutoff)  jV-body  simulation  (after  smoothing). 

Simulation  sequences  using  both  sharp  and  Gaussian  cutoffs  have  been  performed 
for  k"  spectra.  For  our  k„,  filtering,  we  take  kc  = k^  for  the  top  hat  filter.  For  the 
Gaussian  case,  we  take  kc  = k„,  for  n = 1 and  kc  = 1.25 k^  for  n < 0.  For  a 
given  index  n being  filtered  by  either  a top  hat  or  a Gaussian,  the  evolution  as 
marked  by  4(f)  is  practically  the  same  (the  two  6(f)  values  differ  by  < 1%  generally). 
The  Gaussian  case  possesses  more  small  scale  power  but  also  allows  slightly  more 
shell  crossing.  As  shown  by  Melott  et  al.  (1994),  the  compromise  between  these 
two  effects  works  out  nicely  regarding  AT-body  comparisons.  In  Table  4-3  we  show 
evolution  parameters  for  fl)  ~ *"  spectra  with  sharp  k„,  filtering.  The  k^,  6(f),  and 
columns  are  linear  theory  parameters,  whereas  a = (tS2)1/2  is  the  measured  rms 


For  statistical  purposes,  the  matter  distribution  is  characterized  by  the  ii(x)  array 
on  the  simulation  grid.  In  practice,  we  have  a couuts-in-unit-cells  array,  cuc(x), 


evaluated  at  the  1283  grid  points  using  cloud-in-cell  weighting  (so  that  the  counts 
need  not  be  integers).  For  example,  with  f'o  — k~i  and  Gaussian  filtering.  kQ  = 15 
for  k„i  = 12,  we  extract  a thin  i/128  slice  corresponding  to  one  layer  of  the  cue  array. 
This  is  shown  in  Figure  4-25.  In  the  gray-scale  used  in  the  plot,  we  have  clipped 
density  contrast  values  above  9.  although  values  over  30  existed  in  the  slice  plotted. 
We  do  this  because  underdensc  regions  dominate  the  distribution  (the  median  6 value 
is  a;  —0.5,  and  the  S distribution  has  positive  skewness).  It  may  appear  a bit  coarse, 
but  recall  that  there  are  2.097,152  elements  in  the  full  3-d  cue  array.  In  the  analyses, 
we  may  sharpen  the  measured  spectra  after  the  loss  of  resolution  after  cloud-in- 
ccll  binning  (for  the  S[x)  field)  or  discrete,  integer  binning  (for  the  4s(x)  field)  by 
deconvolution.  This  is  generally  not  a large  effect  (cf.  Chapter  5).  Figure  4-26  shows 
galaxy  distribution  slices  stemming  from  the  5(x)  in  Figure  4-25.  Each  slice  (of 
thickness  5/64)  has  « 3,000  galaxies,  as  a subsample  of  as  2 x 105  galaxies  in  the 
cube.  Slice  (a)  is  unbiased  and  has  the  same  statistics  as  the  undiluted  6(x),  not 
including  the  discreteness  shot  noise.  In  slices  (b),  (c),  and  (d),  we  apply  bias  with  a 
threshold  6,  and  vary  the  bias  probability  'P(p).  In  (b),  St  = 1 and  we  take  V = 0.1 
for  6 < In  (c),  we  have  V ~ p1/2  below  St  = 1.  In  (d),  we  have  V ~ p below 
Si  = 3.  In  Figure  4-27  we  show  two  diluted  samples  with  V ~ p1/2.  For  the  top  slice, 
St  = 1:  the  bottom  has  S,  = 3.  The  galaxies  in  the  simulation  cube  have  been  diluted 
to  a;  5 x 104.  and  each  slice  has  as  800  galaxies.  Shot  noise  is  significant  here,  but  the 
measured  2-point  and  3-point  spectra  can  still  be  distinguished  and,  with  the  help  of 
perturbation  theory,  be  used  to  analyze  the  dynamics  and  biasing. 


201 


Figure  4-25.  A L/ 128  thin  slice  from  the  1283  cue  array  corrcspondiug  to  the  i 
ased  j(x)  field.  Power  spectrum  P0  ~ k~l,  Gaussian  cutoff  ka  = 15,  A„r  = 12. 


(a)  unbiased 


(b)  5=0.1,  6 < 1 


:MT- 


(d)  f~p. 


Figure  4-26.  Four  galaxy  distribution  slices  which  are  the  same  except  for  the  bi- 
asing, based  on  the  d(x)  in  Figure  4-25.  Plot  (a)  has  no  biasing.  Different  biasing 
schemes,  with  bias  probability  V below  a threshold  <if.  are  used  for  the  other  three 


Figure  4-27.  Two  dilute  samples,  derived  from  the  matter  distribution  in  Figure  4- 
25.  Each  is  based  on  a bias  probability  V ~ p1/2  below  a threshold  <5|.  Top:  &i  = 1. 
Bottom:  &t  — 3. 


CHAPTER  5 
STATISTICS 

■VI  Introduction 

The  statistical  analysis  of  galaxy  surveys  and  simulated  cosmological  data  is  an 
important  part  of  the  large-scale  structure  problem.  We  focus  on  2-point  and  3-point 
spatial  correlations  primarily  because  they  provide  practical,  measurable  descriptions 
of  observable  matter  distributions  and  because  there  are  perturbative  results  which 
apply  to  large-scale  data.  In  principle,  a full  set  of  A'-point  correlations  would  be 
needed  to  completely  describe  an  evolved  density  field,  but  even  the  2-point  function 
can  have  a noisy  signal,  and  higher  order  correlations  are  smaller  and  have  decreasing 
signal  to  noise  ratios.  Recall  that  the  2-point  function  completely  specifies  the  initial 
Gaussian  density  field,  as  all  higher  order  connected  moments  are  equal  to  zero. 
As  the  matter  distribution  evolves  gravitationally,  the  power  spectrum  P{k),  the 
Fourier  space  2-point  function,  shows  a strong  imprint  of  the  initial  spectrum  in  the 
linear  regime  and  retains  a noticeable  imprint  even  as  nonlinearities  develop.  The 
bispectrum  £l(fci,&2,^3)>  the  3-point  function  in  Fourier  space,  is  the  first  correlation 
function  devoted  solely  to  nonlinearities  and  proves  to  be  a rich  statistic.  For  example, 
the  kj,  k2,  k3  triangle  shape  dependence  probes  the  inherently  anisotropic  nature  of 
gravitational  instability,  and  to  lowest  nonvanishing  order  in  perturbation  theory  the 
effects  of  clustering  and  bias  are  largely  separable  (Fry  1994b).  We  concentrate  on 
the  power  spectrum  and  bispectrum  in  analyzing  the  matter  and  (possibly  biased) 
galaxy  density  contrast  fields  gathered  from  simulations  similar  to  those  we  presented 
in  Chapter  4. 

204 


205 

The  Zel'dovich  approximation  allows  us  to  probe  the  mildly  nonlinear  regime  of 
structure  formation.  This  is  also  known  as  the  quasi-linear  regime  and  marks  the 
transition  between  the  linear  and  highly  nonlinear  stages  of  evolution. 

At  early  times,  the  local  rms  density  contrast  is  small,  a 4L  1,  and  the  density 
fluctuation  modes  evolve  independently  according  to  first  order  perturbation  theory, 

d(k,t)  = 6(t)^(k)  (5.1) 
for  all  k.  The  local  growth  of  fluctuations  is  well  described  by  the  first  order  relation 
G[  ~ b(t).  At  these  early  epochs,  the  linear  theory  applies  to  all  scales,  and  S(x) 
remains  a Gaussian  field. 

After  significant  evolution,  we  have  a > 1,  and  nonlinear  evolution  governs  the 
smaller  scales  while  linear  and  quasi-lincar  regimes  exist  on  larger  scales.  In  general, 
the  rms  density  contrast  or  filtered  at  length  scale  R is  a decreasing  function  of  if. 
At  later  epochs,  the  large  scales  for  which  OR  *X  1 remain  in  the  linear  regime,  and 
small  scales  for  which  or  > 1 have  gone  nonlinear.  The  condition  or  & l marks 
the  transition.  We  identify  epochs  in  our  simulations  by  the  wavenumber  which 
marks  the  nonlinear  transition  according  to  extrapolated  linear  theory: 

^(t)  Pb(k)fik  = l.  (5.2) 
In  the  quasi-linear  regime  (roughly  a £ 1),  the  coupling  of  different  modes  is  im- 
portant, especially  for  k J>  knj,  and  higher  order  correlations  such  as  the  bispectrum 
develop. 

The  main  goal  in  section  55.2  is  to  measure  2-point  and  3-point  correlations  in 
Fourier  space.  The  data  to  be  analyzed  are  the  matter  and  galaxy  density  contrast 
fields,  d(x)  and  6g(x),  from  Zel’dovich  approximation  simulations.  We  present  initial 
results  concerning  the  power  spectrum  in  §5.2.1.  The  effect  of  mode  coupling  is 
illustrated  in  §5.2.2.  In  §5.2.3  we  present  the  reduced  bispectrum  measurements.  In 


206 

§5.3  we  investigate  the  power  spectrum  and  reduced  bispectrum  while  changing  the 
initial  statistics  and  biasing  procedures  in  the  simulations.  The  main  importance 
of  this  section  lies  in  the  fact  that  we  arc  probing  the  reduced  bispectrum  within  a 
controlled  environment. 

Measured  Correlations 

ill Power  Spectrum 

We  present  power  spectrum  results  for  the  underlying  matter  distribution  and 
for  various  discrete  samples.  The  simulations  are  performed  in  a cubical  comoving 
volume.  This  entire  volume  comprises  our  sample  and  is  filled  with  an  n x n x n 
grid  (typically  n = 128).  The  cloud-in-cell  binned  unbiased  5(x)  array  represents 
the  underlying  matter  distribution,  which  is  practically  continuous  (i.e.  discreteness 
effects  are  generally  not  important  in  this  instance).  The  power  spectrum  is  obtained 
from  5(k),  the  discrete  Fourier  TVansform  (FFT)  of  <5(x).  The  discrete  “galaxy" 
samples  are  analyzed  on  the  same  grid,  with  the  galaxy  density  contrast  denoted  by 
8g(x).  Discreteness  effects  potentially  are  important  for  the  galaxy  distribution.  No 
simulated  selection  effects  arc  imposed  (the  samples  are  volume  limited),  and  the 
cubical  geometry  is  retained. 

The  raw  (uncorrected)  power  is  obtained  by  averaging  |5(k)|2  over  all  wavevectors 
of  fixed  length  k ± \Ak.  Wavevectors  are  expressed  as  multiples  of  the  fundamental 
mode,  whose  wavelength  is  Lt  the  length  of  the  cube.  For  one  realization,  the  number 
of  modes  averaged  over  is  approximately  2ir*2Ai.  Since  the  grid  size  is  n x n x n,  our 
Nyquist  frequency  is  n/2  (typically  64  since  we  use  n = 128).  The  (.--space  bin  size  for 
P(A)  measurements  is  taken  to  be  Ak  = 1.  The  error  bars  shown  in  measurements 
below  arc  the  statistical  (standard  deviation  of  the  mean)  l-o  errors.  To  obtain  the 


clustering  power  signal  P[k),  we  include  discreteness  and  grid  blurring  corrections, 
described  below. 

Results  for  initial  power  law  spectra  filtered  at  the  nonlinear  scale  are  shown  in 
Figures  5-1  through  5-8.  The  measured  power  spectra  at  four  stages  of  sharply  knj 
filtered  evolution  arc  shown  in  Figures  5-1  through  5-4.  The  initial  spectra  are  kn 
power  laws,  and  results  for  indices  n = 1,  0,  —1.  and  —2  arc  plotted  in  each  figure. 
Figure  5-1  refers  to  the  earliest  stage,  *„/  = 32.  while  fc,,f  values  of  16.  8.  and  4 in 
Figures  5-2  through  5-4  refer  to  increasingly  later  stages.  Results  from  the  analogous 
simulations  which  used  Gaussian  high-Ar  filtering  are  shown  in  Figures  5-5  through  5- 
8.  As  in  the  previous  chapter,  we  use  kc  = k„/  for  n = 1 and  k, y = 1.25kn(  for 
n<0. 

The  spectra  shown  in  Figures  5-1  through  5-8  were  obtained  by  averaging  over 
one  realization,  and  the  signal  is  still  relatively  sharp.  We  have  averaged  over  10 
tealrzauons  in  the  kc  — 16  case.  Figure  5-2,  to  illustrate  how  such  averaging  reduces 
the  statistical  error  bars  and  clarifies  the  signal.  Below  the  nonlinear  filtering  scale 
(k  S Ani),  the  spectra  generally  follow  the  linear  predictions.  For  k g,  0.5fc^,  the 
transfer  of  power  to  smaller  scales  is  clearly  shown.  The  linear  normalization  (ob- 
tained from  the  initial  conditions)  is  approximately  correct.  Higher  order  corrections 
affect  the  shape  of  P(k)  in  the  linear  regime,  rather  than  merely  changing  the  ampli- 
tude and  leaking  power  to  small  scales.  Figure  5-2  also  shows  the  minor  differences 
between  using  f<Pk  or  the  strictly  correct  discrete  sums  £k  in  computing  the  linear 
normalization.  Recall  that  the  local  variance  a2  =(i2)  is  the  integral  of  the  power 
spectrum:  the  integral  of  62f^j  is  of,  while  the  integral  of  measured  P gives  the  mea- 
sured o.  At  small  scales,  e.g.  * i 2k„i,  we  find  that  an  effective  power  law  P ~ k~^ 
emerges  over  a range  in  k which  approaches  the  Nyquist  limit.  The  power  law  is 
approximately  k~3.  The  least  squares  fits  typically  yield  0 = 2.6-3.0,  independent 


208 


Figure  5-1.  Measured  power  for  initial  Fft  ~ kn  with  sharp  cutoff  kr  = k,d  = 32. 
The  linear  power  is  also  plotted. 


k 


Figure  5-2.  Measured  power  for  initial  P0  ~ k",  kc  = k,j  = 16.  The  power  was 
averaged  over  10  realizations  to  clarify  the  signal.  The  linear  power  is  also  plotted  (the 
solid  curves  from  / d3k  computations,  the  dashed  using  the  strictly  correct  discrete 


210 


Figure  5-3.  Measured  power  for  initial  Pa  ~ k",  kc  = *-.  = 8,  The  linear  power  is 
a so  Dlotted. 


211 


Figure  5-4.  Measured  power  for  initial  kc  = = 4.  The  linear  power  is 

also  [dotted. 


212 


1 10  1 10 


Figure  5-5.  Measured  power  for  initial  Ffo  ~ k"  with  Gaussian  cutoff  kg  « knl  = 32. 
The  linear  power  is  also  plotted. 


213 


1 10  1 10 


Figure  5-6.  Measured  power  for  initial  Pq  ~ k".  1'G  ~ = 16-  The  linear  power 

is  also  plotted. 


1 10  1 10 


216 


of  the  initial  k"  index.  For  example,  the  8 runs  with  k„i  = 4 (-1  different  indices  n,  2 
different  filters)  give  3 = 2.89  ± 0.04.  fitting  from  * = 16-48.  Fitting  over  the  same 
k range  in  the  k„i  = 8 case  gives  0 = 2.72  ± 0.03. 

The  power  spectrum  was  measured  at  various  times  for  fixed  initial  spectra.  The 
general  expectation  is  that  P — b~{t)  for  k £ kn/.  For  k1  and  k~'  initial  spectra 
with  a sharp  cutoff  kc  = 8,  we  plot  the  power  spectrum  in  Figures  5-9  and  5-10  for 
six  stages  of  evolution:  o/  = 0.125,  0.25,  0.5, 1.0,  2.0,  and  4.0.  For  * < kc,  the  62(t) 
scaling  holds  even  to  the  unreasonably  high  oi  = 4 epoch.  For  k > kc.  P — b*(t) 
seems  to  be  the  dominant  effect  early  on.  This  is  shown  in  Figure  5-11,  where  P 
versus  oj  is  plotted  for  fixed  k = 2 and  k - 12. 

The  discreteness  or  shot  noise  contribution  to  the  power  is  1/2V  for  N "galaxies." 
Taking  this  into  account,  the  raw  power  per  mode  signal  is  related  to  the  clustering 
2-point  signal  P(k)  by 


<|4(k)|J>  = />(*)  + ! 


(5.3) 


This  follows  by  analyzing  the  FFT  sum  used  to  generate  4(k), 


m = ^Y,s^e~<kx  • 


(5.4) 


where  V = L3  n3  is  the  volume.  Using  a grid  counts  array  m(x), 


jV 


(5-5) 


which  for  k ^ 0 is  just 


4(k)  = — + 


(5.7) 


(5.6) 


217 


res,  bottom  to  top  from  the  left. 


218 


Figure  5-10.  Power  P(k)  vs.  k at  several  epochs,  for  an  initial  spectrum  ~ AT1, 
Arc  = 8.  The  six  times  are  <r(  = 0.125,  0.25,  0.5,  1.0, 2.0,  and  4.0  corresponding  to  the 
six  curves,  bottom  to  top  from  the  left. 


219 


0.1  1 


Figure  5-11.  Power  P(k)  at  fixed  k versus  oj.  The  initial  spectrum  is  — k",  kc  = 8 
with  n = l and  n = -1.  In  each  panel  we  plot  P versus  ij(  at  k = 2 {top  curve)  and 
* = 12  (bottom  curve).  The  dashed  line  has  slope  2,  the  dotted  line  slope  4. 


If  m(x)  is  0 or  1 and  if  m(x)  represents  a uniform  random  process,  then  the  sum  (5.7) 
is  a random  walk  of  N steps  of  length  l/N  in  the  complex  plane,  and  the  power  is 

(l«(k)l2>  = (5.8) 
Even  with  no  clustering  signal,  this  noise  would  be  present.  Discussions  on  discrete- 
ness effects  can  be  found  in  Peebles  (1980),  Baumgart  and  Fry  (1991),  and  Fry,  Melott, 
and  Shandarin  (1994). 

Another  natural  candidate  for  discreteness  corrections  to  the  power  would  be 
1/Afocc,  where  Nocc  is  the  number  of  occupied  cells  in  the  m(x)  array.  If  there  were 
effectively  N/Noce  galaxies  per  cluster  on  the  scale  of  the  grid  cell,  the  random  noise 
in  the  power  signal  of  the  clusters  becomes  l /Nocc-  A third  possibility  for  a correction 
follows  if  we  do  not  assume  that  (m2)  =(m)  in  the  computation  of  the  (i(k)i(k')) 
sum,  which  leads  to  the  following  expression  for  the  raw  (uncorrected)  power, 

+ (5.9) 
where  £ is  the  spatial  2-point  function.  In  the  second  term,  which  is  what  we  call 
the  clustering  power  spectrum,  we  have  (m(x)m(x'))  = tfi2(l  +£(|x-x'|).  The  first 
term  does  reduce  to  l/N  and  l /Nocc  in  the  respective  limits. 

To  check  the  validity  of  the  code  which  computes  the  P(k)  estimator,  and  to 
validate  our  ideas  on  l/N  noise,  we  analyze  data  which  are  generated  by  a uniform 
Poisson  process.  We  place  N points  into  the  sample  volume  with  a pseudo-random 
number  generator  and  then  bin  them  in  the  1283  unit  cells.  For  one  realization  of 
N — 1000  and  one  of  N = 10000,  we  have  plotted  the  raw  power  spectrum  in  Figure  5- 
12.  The  dashed  lines  indicate  the  expected  l/N  signal.  Convergence  on  the  expected 
value  improves  with  multiple  realizations.  In  the  case  of  oloud-in-cell  binning,  which 
we  use  only  for  the  unbiased  underlying  matter  distribution,  discreteness  should  not 


generally  be  relevant.  Since  128-3  w 5 X 10-7,  such  a signal  could  play  a role  if  one 


«“*<->  = {J*  othe'rwiM^ 


S(fe.  *„,*,)  = Sa(^)  Sa(2^)  Sa(^i) . 


10-» 


N = 1000 


Figure  5-12.  Raw  power  spectrum  of  a uniform  Poisson  process  with  jV  particles. 
The  dashed  lines  show  the  l/N  values.  In  the  c-i-c  case,  the  points  follow  l/N 
modulated  by  Sa4(7rA:/128) — the  dotted  curve  that  can  be  seen  for  k > 50  (where  the 


$*(*».*»,*.)  = Sas(^)Sa2(^)SaJ(^) 


224 


Figure  5-13.  Sharpening  correction.  The  error  bare  show  the  maximum  difference 
between  the  isotropic  approximation  Sa(tr*/128)  and  Sharp)**,  ky,  **).  Even  at  the 
Nyquist  frequency  (*  = 64)  this  error  is  only  3%. 


225 


Figure  5-14.  Degradation  of  power  signal  with  dilution.  Four  samples  from  a Fq  — 
. kc  — k„;  = 16  are  taken.  The  N = 1283  panel  shows  the  linear  spectrum  as  well 
as  the  underlying  c-i-c  binned  and  discretely  binned  result— only  for  k > 60  can  the 
discretely  binned  power  be  seen  to  dip.  The  N = 103  signal  is  not  useful  for  k > 


226 

Table  5-1  Power  law  indices  for  dilute  samples:  ka  for  k £ km,  k^  for  k £ fcn/. 


N 

a 

0 

1283  (c-i-c) 

-1.00  ±0.05 

-2.85  ±0.01 

1283 

-1.00  ±0.05 

-2.75  ±0.01 

10s 

-0.99  ±0.05 

-2.58  ±0.04 

104 

-0.98  ±0.08 

-2.46  ±0.29 

103 

-1.48  ±0.23 

-0.9  ±0.7 

We  use  self  consistency  as  the  criterion  for  choosing  a discreteness  correction,  and 
the  1/JV  choice  generally  proves  best.  Experiments  which  compare  the  variance  of 
the  power  signals  as  only  N is  varied  tend  to  bear  this  out. 


Mode  Coupling 

In  §3.3.1,  we  obtained  a perturbative  expression  for  d(k)  which  explicitly  displayed 
the  coupling  of  modes  (Grinstein  & Wise  1987), 

S(k,t)  = X>"/ d**|. ..d’MdolGatk, khWrfkl)  ■ ■ 4(kn) 

n=1  (5.16) 

o.(k1--k„)  - ^ (k  2k|)  (k'2ka)...  (k;jtn) 

where  we  set  k = £ kj  due  to  the  [djj]  constraint.  We  apply  results  based  on  (5.16) 
to  a few  special  cases  below. 

The  power  spectrum  is  given  by 

(d(k)d(k'))  = [4C(  k + k-)]  P(k) . (5.17) 

The  product  (d(k)d(k'))  is  computed  order  by  order,  accounting  for  the  scaling  of 
the  perturbative  series,  |ln*  ~ (#*>)».  We  utilize  the  formalism  of  Fry  (1984),  Goroff 
et  al.  (1986),  and  FVy  (1994a).  Since  = 6(t)6o  is  a Gaussian  field,  all  its  odd 
statistical  moments  vanish,  so  the  first  corrections  to  the  P(k)  arc  ~ (^<1>)4  and 


227 

therefore  involve  second  and  third  order  perturbation  theory.  Explicitly  factoring  out 
the  b{t)  dependence,  we  have 

P{k)  = # pn + b*  1 2P13  + Pn  1 + 16  [ p,5 + 2^4  + Pa 1 + 

(S.18) 

(aim)i<")>  = (iD]6m+npmn. 

Note  that  Pn  = /50-  Fry  (1994a)  computes  terms  in  this  series  up  to  order 
and  applies  it  to  order  66,  which  utilizes  fifth  order  theory,  in  the  case  of  the  exact 
dynamics.  The  first  nonlinear  corrections  are 

Pl3  = mi')  f d3kC3(k,k',-k')Po(k') 

r , 1 „ (5.19) 

Pn  = 2 J d3*  G^k1,  k - k')  /%(*.')  fl)(|k  - It'D . 

These  two  terms  represent  the  1-loop  corrections  to  the  2-point  correlations,  as  in- 
vestigated by  Scoccimarro  and  Fricman  (1996),  who  extensively  develop  perturbative 
calculations  beyond  leading  order  (i.e.  beyond  the  tree  level  contributions).  See  Fry 
(1984),  Goroff  et  al.  (1986),  and  Scoccimarro  and  Frieman  (1996)  for  discussions  of 
a diagrammatic  approach  to  perturbative  calculations  of  IV- point  correlations. 

For  the  Zel’dovich  kernel  (5.16)  we  compute  these  corrections  in  the  case  of  a top 
hat  initial  spectrum,  Pq  ~ hat(k| ,tj).  The  function  hat(k|,  k^)  is  1 for  kj  < k < kj 
and  0 otherwise.  Such  a spectrum  contains  no  initial  power  at  large  scales  ( k < kj) 
or  small  scales  (k  > kj)  and  so  provides  an  environment  to  display  mode  coupling 
explicitly.  A k4  small  k tail  (Zel’dovich  1965)  should  be  detectable,  since  there  is  no 
linear  power  to  dominate  it. 

Zel’dovich  (1965)  and  Peebles  (1974b,  1980)  describe  how  nonlinearities  induce  a 
P ~ k4  spectrum  on  large  scales  when  no  large-scale  power  is  initially  present.  Fry 
(1994a)  shows  how  this  behavior  peisists  to  higher  orders  in  perturbation  theory.  In 
the  cose  of  a homogeneous  distribution  of  particles,  with  interparticle  length  scale  xo, 
simply  displacing  the  mass  will  generate  a minimal  power  signal  as  k -r  0.  Naively, 
by  looking  at  the  5(k)  transform  (5.4),  one  would  obtain  5(k)  ~ kx0  for  small  k,  but 


in  fact  such  a term  vanishes  due  to  momentum  conservation.  The  next  term  gives 
d(k)  ~ (kx o)2.  which  supplies  the  minimal  k 4 spectrum,  in  our  3-d  c-i-c  binned 
simulations,  this  signal  is  marginally  observable  (due  to  limited  dynamic  range),  as 
we  show  below.  Note  that  this  effect  is  observed  without  c-i-c  discreteness  corrections, 
and  such  a signal  can  be  detected  far  below  the  1/,V  shot  noise  level.  In  2-d  or  1-d 
simulations,  the  effect  is  more  easily  observed. 

We  perform  the  1-loop  corrections  for  the  case 


fi)  = Chat(A,,2*,).  (5.20) 

The  results  depend  on  the  combination  Cl?,  and  we  fix  C by  our  f d3k  fl|(k)/t-* 
simulation  normalization,  equation  (4-19).  This  gives  C = ir /ki,  and  the  local  linear 
(52)  fluctuations  are 


°I  = J <tk  l?mw  = . (5.21) 


For  the  case  Pq  — hat(l,2),  o\  = o.l,  the  individual  contributions  to  P(k)  up  to 
second  order  are  plotted  in  Figure  5-15.  The  P13  contribution  is  negative  (~  — A-2) 
for  k 1 < k < ki  and  zero  outside  this  interval;  the  absolute  value  of  64Pi3  is  the 
dashed  curve  in  Figure  5-15.  On  large  scales,  the  P22  term  provides  a A4  tail  as 
k -*  0,  though  this  bends  over  near  ki-  Specifically,  taking  the  small  k limit  of  P22, 


(5.22) 
id  the  full  b*Pa 


we  obtain 

(,4  ft,  ’r3('4*'1  _ ft’?*4 

22  ofcj  3920xk[  ' 

This  asymptote  is  plotted  as  the  dot-dashed  line  in  Figure  5-15,  at 
term  is  drawn  as  the  solid  curve.  For  Aj  < k < 1'2,  P22  provides  a small  positive 
contribution,  but  IFfal  > P22,  so  the  linear  power  is  reduced  from  k\  to  *2  as  power  is 
transferred  to  smaller  scales  (and,  minimally,  to  larger  scales).  From  *2  to  2*2  = 4*i, 
the  power  comes  from  harmonics  generated  by  P22 


Figure  5-15.  Second  order  contributions  to  P(k).  The  linear  spectrum  (thick  solid 
line)  is  62fl)  ~ liat(1.2)  for  o,  = 0.1.  The  smaller  nonlinear  contributions  are 
(solid  curve  for  0 < * < 4)  and  ti'Pn  (dashed  curve  for  1 < k < 2— we  plot  |P13| 
since  it  is  negative),  and  we  show  the  asymptotic  k*  tail  (dot-dash  line  for  small  k). 


Figure  5-16,  Mode  coupling  aud  A-*1  tail  for  the  second  order  power  spectrum.  Based 
on  an  initial  spectrum  Po  ~ hat(10.20)  and  perturbation  theory  up  to  third  order. 
"(*)  is  plotted  for  a,  = 0,125,  0.25.  0.5,  1.0.  2.0,  and  4.0  (bottom  to  top).  The 
asymptotic  k4  tail  (dashed  line)  is  plotted  in  the  a = 1.0  case.  Perturbation  theory 
breaks  down  for  aj  > 1 (dotted  curves). 


232 


k 


Figure  5-18.  Measured  power  for  initial  Ptj  ~ hat(6, 12).  The  six  epochs  are  specified 
by  <7/  = (1.5)',  * = -3, ....  2 (spanning  from  oj  as  0.3  to  2.25.  from  bottom  to  top). 
Perturbation  theory  has  broken  down  for  the  dotted  curves  (a,  > 1).  The  asymptotic 
I'1  tail  (dashed  line)  is  plotted  for  o,  = 1. 


233 

Recall  that  k has  already  been  scaled  by  2a /L  to  account  for  the  integer  wavevec- 
tors  used  on  the  simulation  grid.  For  our  simulations,  it  > 1.  but  k{  can  not  be  too 
large,  since  we  must  account  for  the  dynamic  range:  128®  grid,  k„m  = 64.  single 
precision  code  (real  numbers  are  four  bytes).  To  illustrate  the  scaling  with  time,  the 
perturbation  theory  calculation  is  shown  in  Figure  5-16  for  the  case  k\  = 10,  with 
evolution  specified  by  <t/  = 0.125.  0.25.  0.5,  1.0.  2.0,  and  4,0.  For  o;  ns  1.47.  the 
A 3 begins  to  overcome  the  linear  contribution  and  perturbation  theory  breaks  down. 
The  dotted  curves  (<tj  = 2 and  4)  in  Figure  5-16  are  therefore  truncated.  Measured 
power  in  the  perturbative  regime  is  shown  in  Figure  5-17.  where  oj  = 0.5  for  an  initial 
hat(10,20)  spectrum.  The  spectrum  shown  was  measured  from  one  realization,  rather 
than  averaging  over  several  to  reduce  noise.  This  o;  = 0.5  run  is  just  early  enough 
that  the  2k\<k<  4ii  range  is  well  covered  by  the  1-loop  mode  coupling.  Later  on, 
2-loop  effects  (Pu)  decrease  P(k)  in  the  2k\  < k < 4*i  range  while  providing  (P33) 
leading  order  harmonics  for  Ak\  < k < Ski  (Fry  1994a).  For  small  k the  signal  is 
modified  by  aliasing— spurious  high  k (above  Nyquist)  to  low  k transfer  due  to  the 
periodicity  of  the  FFT  series.  To  better  observe  the  k4  tail  and  the  62  and  b4  scaling 
in  a simulation,  we  choose  ki  = 6,  to  allow  grid  effects  to  be  washed  out  even  for  the 
early  («»/  Si  1)  epochs.  The  measured  power  spectrum  and  the  perturbative  result  are 
shown  in  Figure  5-18  for  the  hat(6. 12)  case,  at  the  six  times  at  = 0.296, 0.444,  0.667, 
1, 1.5,  and  2.25  (which  differ  a constant  factor  of  1.5).  The  signal  has  been  cleaned 
up  by  averaging  over  20  realizations  of  the  initial  Gaussian  field.  The  aliasing  of  the 
FFT  cannot  be  eliminated  here.  The  small-scale  signal  for  k > knyq  = 64  is  folded 
back  into  the  1 < k < 64  range.  This  effect  is  apparent  for  k ns  k„yq  (note  that 
the  signal  levels  off  or  even  turns  upward).  Also,  the  aliasing  is  apparent  at  small  k, 
since  the  magnitude  of  the  perturbative  k4  part  is  comparable  to  or  smaller  than  the 


measured  signal  leveling  off  al  the  Nyquist  frequency  (note  that  for  k <3  this  is  true 
in  all  the  curves  of  Figure  5-18). 

We  performed  high  resolution  one-dimensional  simulations  to  check  whether  higher 
order  perturbative  effects  can  be  detected  in  the  power  spectrum.  Using  a 2IS  = 
262144  grid  size  and  ahat(10,20)  initial  spectrum,  the  power  spectrum  was  measured 
by  averaging  |d(lc)|2  for  100  realizations.  For  six  times,  oj  = 2‘.  i = -5,-4,...,0 
(from  a = 0.03125  to  1).  the  measured  power  is  plotted  in  Figure  5-19.  The  second 
order  power  spectrum  (the  1-d  version),  which  includes  the  k'1  tail  and  harmonics  up 
to  * = 40.  is  also  shown  at  each  epoch.  Note  that  the  second  order  spectrum  (dotted 
curve)  for  the  o/  = 1 case  is  truncated.  Higher  order  mode  coupling  is  clearly  evident. 
The  P ~ 62("-U  scaling  for  n-loop  corrections  is  illustrated  in  Figure  5-20,  which 
uses  the  same  data  as  Figure  5-19  but  plotted  at  several  fixed  k values  to  show  the 
scaling  with  time.  We  have  &i  = 10  and  k2  = 20,  and  for  k > 20  we  choose  k values 
of  25,  45,  and  65  to  test  the  1-,  2-,  and  3-loop  scaling,  since  Pmm  provides  the  leading 
contribution  from  (m  - 1)*2  to  01*2  (Fry  1994a). 

The  Pmn  contributions  to  P{k)  may  be  integrated  to  obtain  the  local  variance 


“2  =(<S2)  = J^kPik). 


(5.23) 


For  the  case  Po  ~ sharp  cutoff  at  kc,  Scoccimarro  and  Frieman  (1996)  have 
obtained  the  higher  order  corrections  up  to  the  3-loop  level  (including  up  to  seventh 
order  in  the  perturbation  series  (5.16).  Scoccimarro  and  Frieman  (1996)  find  that, 
independent  of  n, 


= a,2  + 1.267  of  + 2.022  of  + 3.939 of  + O(o/0) . 


(5.24) 


We  find  that  while  t^o/dft2  £ 0,  this  expression  provides  a good  approximation  to 
the  measured  values  (provided  that  normalization  integrals  are  computed  using 


235 


Figure  5-19.  Mode  coupling  for  1-d  hat(10.20)  simulation.  Measured  power,  aver- 
aged over  100  simulations,  is  shown  at  six  epochs,  a\  = 2'  . i = -5 0 (from  ns  0.03 

to  1).  The  second  order  power  spectra  plotted  as  the  solid  curves.  For  oi  = 1 (dotted 
curve),  perturbation  theory  breaks  down.  Higher  order  perturbative  mode  coupling 


236 


Figure  5-20.  Pertubative  scaling  in  the  1-d  power  spectrum.  From  the  data  of  Fig- 
ure 5-19,  we  show  power  versus  time  a;  — 6(t)  at  several  fixed  wavenumbers.  The 
linear  order  (tree  level)  spectrum  scales  according  to  P - b2,  while  the  n-loop  con- 
tributions scale  as  The  lines  of  slope  2,  4,  6.  and  8 indicate  these  growth 


Figure  5-21.  Local  fluctuations  to  3-loop  level  for  Pq  ~ k".  The  solid  curves  show 
measured  rms  fluctuations  for  n = -3  (top  curve),  -2,  -1,  0,  and  I (bottom  curve! 
for  an  initial  power  law  spectrum  with  = 10-  The  dashed  line  shows  the  first  order 
result.  The  dotted  curve  shows  the  perturbative  expansion  for  a including  3-loop 


238 

to  account  for  the  discrete  simulation  grid).  In  Figure  5-21  we  plot  the  (5.24)  result 
along  with  measured  values.  An  initial  k"  spectrum  with  n = -3,  -2.  -1, 0, 1 and 
kc  - 10  was  used. 

5.2.3  Reduced  Bisoectnmi 

We  have  measured  3-point  correlations  under  a variety  of  conditions.  In  par- 
ticular. the  bispcctrum  B(k\,  (rj.  fcj)  has  been  measured  for  underlying  matter  dis- 
tributions and  discrete  samples.  The  three  degrees  of  freedom  associated  with  the 
closed  k-space  triangle  need  to  be  convemcnilv  parametrized.  One  natural  choice 
(Fly.  Melon.  k Shandarin  1993)  is  to  fix  magnitude  of  the  k|  and  kg  vectors,  where 
we  take  ki  > 4*2.  Then,  since  Yl  kf  =0.  the  third  vector  has  a magnitude  which 
spans  the  range  Aj  — kg  to  k\  + A' 2.  and  we  parametrize  the  configuration  by  0,  the 
angle  between  ki  and  kg.  The  k\  ± *2  range  in  length  scales  allows  us  to  gauge 
nonlinearity  by  setting  k|  with  respect  to  kn[.  while  the  0 dependence  reflects  the 
anisotropic  nature  of  gravitational  instability. 

Second  order  perturbation  theory  provides  a useful  prediction  for  the  bispectrum 
using  the  G2  kernel  in  (5.16), 

B(*i,*4,*3)  = QuPiPi  + Q23P2P3  + QnW 
Qij  = 1 + k,  ■ ks  + 1)  + (k,  ■ k2)2 . (5  25) 

This  is  the  perturbative  result  to  first  nonvanishing  order,  i.e.  to  tree  level  (Fry  1984). 
A systematic  development  of  3-point  correlation  results,  including  1-loop  corrections, 
is  provided  by  Scoccimarro  (1997). 

Inspired  by  galaxy  survey  measurements  (Peebles  k Groth  1975,  Fly  k Scld- 
ner  1982)  which  show  hierarchical  3-point  correlations  and  by  theory,  the  reduced 
bispectrum  Q is  defined  as 


239 


With  this  definition,  much  of  the  B dependence  on  scale,  time,  and  initial  spectrum  is 
eliminated,  leaving  a function  with  values  of  order  unity.  For  example,  with  equilateral 
triangles  (k,-kj  = -\)  the  reduced  bispectnun  (5.26)  is  a constant  J.  For  the  reasons 
mentioned  above,  we  utilise  anisotropic  triangle  configurations  parametrized  bv  the 
angle  0 between  k|  and  kj.  VVe  fix  k\  = 2*2,  so  the  range  in  k goes  from  J*i  to  j*>. 
For  scale-free  initial  spectra,  Po  ~ *n  with  no  cutoff  (or  with  §*i  below  the  filtering 
scale)  equation  (5.26)  gives  a bispectrum  with  a significant  shape  dependence,  as 
shown  in  Figure  5-22. 

Although  we  specify  *2/*i  = 0.5,  it  is  possible  to  choose  other  values  for  the 
bispectrum  triangle  configuration.  At  the  least,  a clean  P(k)  signal  at  k\,  fcj  and  for 
the  k\  — kt  < *3  < *1  +*2  range  would  be  needed.  If  the  2-point  function  is  not  clear, 
one  would  not  expect  the  3-point  function  to  yield  much  that  is  useful.  In  terms  of 
the  dynamic  range  of  our  simulations,  *2/*i  — 9*5  provides  a good  compromise.  If 
one  chooses  *2/*i  « 0.5,  then  essentially  two  scales  are  being  probed,  k\  ~ *3  and 
large  scales  fco,  both  of  which  are  likely  not  to  be  within  range  of  measurement  in  the 
same  data  set.  Choosing  kj  a *,  gives  triangles  which  span  scales  0 < k < 2*j,  a 
viable  possibility  only  if  signals  are  clear  in  the  entire  range.  In  Figure  5-23  we  show 
the  Q{0)  perturbative  predictions  for  various  kj/ki  ratios,  for  fh  ~ *-1. 

From  a signal  detection  point  of  view,  the  more  shape  dependence  the  better.  In 
the  nonlinear  regime,  we  find  that  the  Q(9)  curve  tends  to  flatten  out.  We  observe 
a similar  flattening  when  bias  is  present.  In  trying  to  ascertain  the  effects  on  non- 
linearity or  bias,  we  use  the  tree  level  perturbative  result  Qpt(8),  equation  (5.25),  as 
our  reference.  As  for  obtaining  more  or  less  configuration  dependence,  it  turns  out 
that  the  *2/*i  choice  is  not  necessarily  crucial,  which  can  be  seen  from  Figure  5-23, 
though  we  find  k\  = 2*2  the  most  practical  choice.  Although  a scale-free  kn  spec- 
trum is  an  idealization  of  sorts,  it  is  quite  possible  to  find  data  for  which  effectively 


240 


Figure  5-22.  Reduced  bispcclrum  Q(0)  in  perturbation  theory  for  scale-free  power. 
P — kn  with  no  cutoff.  The  Fourier  space  triangles  are  defined  by  k\  = and  0 is 
the  angle  between  ki  and  kj.  From  top  to  bottom,  the  curves  are  for  n = -2.  -1.0. 


241 


Figure  5-23.  Perturbative  Q{0)  for  various  *2/4*1  values.  For  triangle  configurations 
specified  by  a fixed  *2/^1  ratio  (ff  is  the  angle  between  ki  and  k^)  the  reduced 
bispectrum  is  plotted  for  P — fc_l. 


Figure  5-24.  Perturbative  Q(0),  with  *1  = 2 k2  and  power  filter  kj  » k\.  The  power 
spectrum  is  P ~ t_1  up  to  the  filtering  scale,  after  which  it  is  cut  off  or  approximated 


243 

P - kn  over  the  length  scales  studied.  If  hi  < kf  but  |Ai  falls  beyond  kf,  then 
equation  (5.25)  could  still  be  applied  to  get  nonzero  Q predictions,  but  this  is  not 
useful.  The  measured  power  can  be  used  in  the  Q{B)  equation  (5.25),  or  one  can 
use  an  approximate  model.  A useful  estimate  takes  the  linear  power  for  k < kf  and 
joins  this  with  an  approximate  P ~ A-3  beyond,  since  we  found  that  our  AI|(  filtered 
simulations  typically  had  such  a high  k tail.  In  Figure  5-24  we  show  the  bispectrum 
curve  for  P - k~‘  with  various  high  k behaviors  when  kf  a Aj.  Along  with  the 
scale-free  case,  we  show  a sharp  cutoff  at  Ai , a Gaussian  cutoff  at  1.25*!.  and  a sharp 
transition  to  A"3  at  A) . 

In  an  effort  to  have  a strong  signal,  we  let  k\  coincide  with  A,,/,  which  also  serves 
as  the  filtering  scale.  It  is  possible,  and  even  desirable,  to  choose  A|  < But  if 
is  too  small,  two  difficulties  may  arise.  The  bispectrum  signal  may  become  indistin- 
guishable from  the  noise,  since  the  bispectrum  is  small  in  the  linear  regime.  Also,  for 
small  kj  values,  there  are  less  triangle  configurations  in  A-space  to  average  over.  Fry 
et  al.  (1993)  estimate  this  statistical  effect  on  the  reduced  bispectrum  error  bars  -*Q 
in  three  dimensions  to  roughly  follow  A Q - (An//Ai)3+n,  qualitatively  born  out  in 
our  data.  We  choose  four  epochs.  A„;  = 32, 16. 8.  and  4,  in  a series  of  ~ kn  simu- 

lations with  n = 1,  0.  -1.  and  -2.  Results  are  shown  in  Figures  5-25  through  5-28, 
in  the  case  of  sharp  kc  = k„i  filtering.  Analogous  Gaussian  filtering  results  are  shown 
in  Figures  5-29  through  5-32.  The  Q error  bars  shown  are  the  statistical  errors  asso- 
ciated with  averaging  over  A-space  configurations.  Even  at  this  stage  of  nonlinearity, 
the  Q signal  shows  the  characteristic  shape  expected  from  perturbation  theory,  aud 
furthermore  the  curves  are  more  flattened  as  n increases.  For  completeness,  we  show 
the  data  for  each  of  the  four  different  cutoffs.  Were  it  not  for  the  A-space  resolution 
differences,  one  would  expect  the  signals  to  be  virtually  identical  (for  the  same  n), 


since  they  are  characterized  by  A,,;  = k\  in  each  case.  Figure  5-33  shows  the  bispec- 
trum data  for  the  four  different  cutoffs  in  the  - k~l  case.  The  perturbative  Q{0) 
curve  is  plotted  using  P ~ A--3  for  k > kr.  Perhaps  surprisingly,  this  accounts  for 
the  overall  behavior  of  the  data,  though  the  measured  curve  is  more  Battened. 

The  clustering  signal  in  the  reduced  bispectrum,  which  we  have  called  Q(6),  is 
obtained  from  the  raw  signal  by  performing  discreteness  and  sharpening  corrections. 
The  raw  bispectmm  is  W(ki)i(kj)d(k3))  averaged  over  closed  triangle  (£kj  = 0) 
configurations.  Taking  into  account  shot  noise,  this  is  related  to  the  clustering  3-point 
signal  B(khk2,k3)  by 

(i(k,)j(kj)5(k3)>  = B(*i,*2,*3)  + ^|P(*i)  + P(*j)  + P(k3)  I + ij.  (5.27) 

This  situation  is  analogous  to  the  2-point  case,  equation  (5.3).  Let  kj  and  kj  be 
nonzero,  with  k i > k2.  Then  to  close  the  triangle  we  have 


The  nonzero  restrictions  simply  remove  k,-  = 0 delta  function  terms  (not  applicable 
to  the  measurements  anyway)  from  the  expression  below.  Directly  from  the  discrete 
grid  counts  array  m(x),  we  calculate  the  third  moment  to  be 


where  C is  the  spatial  3-point  function.  The  two  discreteness  terms  involving  l/N  (or 
1/Woar)  reproduce  the  l/.V  terms  of  (5.27),  ui  the  appropriate  limits.  As  in  the  power 
spectrum  case,  we  find  that  discreteness  corrections  (5.27)  are  reasonably  effective  in 
separating  out  the  clustering  signal.  Isotropic  sharpening  corrections  are  applied  after 
the  shot  noise  subtractions. 


k3  = + k2  + 2 A]  A‘2  cos  0 . 


Wk,)f(kj)i(k3)>  = p2<(r,rV 


E-i(ki-r+k>.r') 


245 


fl/rr 


Figure  5-25.  Measured  reduced  bispectrum  Q(0)  Tor  P0  ~ k"  with  sharp  cutoff 
kc  = kni  = 32  and  triangles  with  k\  = 2 hi  = 32.  The  perturbative  results.  Qpt  using 
the  measured  power,  are  shown  as  the  dotted  curves. 


246 


0/tt 


Figure  5-26.  Measured  reduced  bispectrum  Q(B)  for  Pq  — k".  kc  = *„/  = 16  and 
— 2*2  = 16.  Dotted  curves:  Qpt  using  the  measured  power. 


Figure  5-27.  Measured  reduced  bispcctrum  Q(fl)  for  P0  ~ *",  kc  = k„,  = 8 and 
fcl  — 2*2  = 8.  Dotted  curves:  Qjit  using  the  measured  power. 


3 

3 

3 

3 

1 

3 

1 

3 

3 

1 

0 


248 


8.  Measured  reduced  bispectrum  0(6)  for  Pq  ~ kn,  kc  = k„i  = 4 and 
! 4.  Dotted  curves:  Opt  using  the  measured  power. 


3 

2 

1 

3 

.2 

1 

2 

1 

3 

2 

0 


249 


B/i r 


9.  Measured  reduced  bispectrum  Q(0)  for  ~ A"  with  Gaussian  cutoff 
- 32  and  triangles  with  A-j  — 2*2  = 32.  The  perturbative  results,  Qpl  using 
red  power,  are  shown  as  the  dotted  curves. 


3 

3 

2 

4 

3 

2 

1 

3 

2 

0 

C 


8/ n 


1 Measured  reduced  bispcctrum  Q{8)  for  Pq  ~ kn,  kg  a k„i  = 16  and 
= 16.  Dotted  curves:  Qpi  using  the  measured  power. 


251 


e/jr 


Figure  5-31. 
*1  = 2*2  = 8. 


252 


Figure  5-32.  Measured  reduced  bispectrum  Q(0)  for  - *",  kg  « *„,  = 4 and 
ki  — 2*2  — 4.  Dotted  curves:  Qpi  using  the  measured  [lower 


253 


254 

5.3  Nonlinearity  and  Bias 

The  measured  power  generally  follows  the  linear  theory  predictions  for  k < kn, . 
Results  from  §5.2.3  suggest  that  tree  level  perturbation  theory  is  still  providing  quali- 
tatively correct  bispectra  predictions  into  the  nonlinear  regime.  Linear  theory  predicts 
0 = 0,  but  the  skewness  of  the  6 distribution  increases  with  time  (—  6(1)  to  lowest 
order),  and  we  find  that  a 3-point  signal  is  measurable  even  in  the  quasi-linear  regime. 
For  the  3-point  function,  second  order  perturbation  theory  is  utilised  to  find  the  first 
nunvanishing  predictions.  We  may  control  the  nonlinearity  in  our  Q(0)  measurements 
by  varying  Choosing  k\  kuj,  though  not  possible  in  practice  due  to  lim- 

ited dynamic  range,  would  guarantee  compliance  with  perturbation  theory.  Since 
ki  = 2*2,  choosing  §*|  S.  should  perhaps  prove  sufficient  for  some  second  order 
perturbative  results  to  play  a useful  role  in  data  analvsis  involving  mild  nonlinearities 
and  bias. 

Since  the  initial  spectra  are  filtered  for  k £ k;.  the  scale-free  perturbative  results 
for  Q{0)  are  modified  if  and  we  may  account  for  this  (e.g.  by  adding  a 

k~  tail  to  P{k)  or  by  simply  using  the  measured  power  in  the  perturbative  Q,  as 
is  necessary  in  astronomical  data).  Although  the  tree  level  perturbative  bispectrum 
expression  is  fixed  for  Pa  ~ kn  and  g*l  g the  Q(B)  signal  would  be  expected 
to  change  due  to  higher  order  nonlinear  effects  and  due  to  the  dynamic  range  of  the 
Q estimator.  We  are  able  to  observe  higher  order  effects  which  depend  on  the  power 
index  n and  on  *i/*„j. 

The  correlation  functions  should  also  be  modified  by  biasing.  We  use  a local 
model  of  biasing, 

h-t&r- 


(5.30) 


The  power  spectrum  (|i9(k)|2)  differs  from  the  underlying  (|i(k)|2)  according  to  linear 


Pb(k)=qP(k). 


(5-31) 


If  linear  bias  applied  in  general,  then  the  3-point  function  would  be  expected  to  change 
by  the  same  b\  factor-  Bb  = b^B,  so  the  reduced  bispectrum  would  be 


Consistent  to  second  order  in  perturbation  theory  and  nonlinear  local  bias,  the  3-point 
function  (Fry  & Gaztanaga  1993)  is 


We  are  able  to  observe  the  effects  of  biasing  in  P(k).  More  importantly,  since  there 


nonl  car  t es  unrelated  to  bias  do  affect  the  Q(0)  signal,  we  find  that  the  biasing 
effects  are  still  appreciable. 

Regarding  potential  observations,  note  that  Q(0)  is  largely  O0  independent,  due 
to  the  nearly  exact  lack  of  flo  dependence  in  the  second  order  perturbative  expres- 
sions (Bouchet  et  al.  1992),  and  it  depends  on  shape  rather  than  normalization. 
The  Hubble  constant  h and  density  parameter  flo  (which  come  into  play  for  P(k) 
normalization)  are  not  directly  in  the  equations.  As  for  the  power,  note  that  cos- 
mic background  radiation  AT/T  fluctuations  shed  light  on  the  largest  scales,  where 
P(k)  ~ k.  If  uncertainties  in  the  background  fluctuation  data  are  low,  then  (within 
model  dependent  uncertainties)  the  data  are  really  probing  ancient,  unbiased  matter 
fluctuations.  With  a large  scale  P(k ) in  hand  (up  to  a normalization  including  an 
A2flo  factor),  one  may  try  to  match  this  with  the  galaxy  survey  power  spectra  on 
scales  below  100  A-1  Mpc,  but  any  determination  of  is  always  tied  in  with  flo  and 


(5.32) 


(5.33) 


i (FVy  1994b),  we  also  detect  the  effects  in  Q(8).  While 


256 


can  be  quite  model  dependent.  As  pointed  out  by  Fry  (1994b)  regarding  survey  data 
and  bv  Fry  ct  al.  (1995)  regarding  iV-body  simulation  data,  the  (J(0)  statistic  is  a 
promising  candidate  for  probing  bias. 

In  Figures  5-34-5-38  we  control  n and  k\/kni  in  a series  of  3-point  measurements. 
The  persistent  success  of  the  tree  level  Q(0)  result,  equation  (5.25),  is  demonstrated. 
We  find  the  bispectrum  signal  is  flattened  by  higher  order  effects  for  n J;  -1.5.  For 
n £ —1.5  the  configuration  dependence  is  actually  more  pronounced.  The  robustness 
of  the  perturbative  result  is  not  closely  tied  with  use  of  scale-free  power  spectra,  as 
we  have  utilized  other  initial  spectra  as  well.  Furthermore,  the  filtering  we  must 
necessarily  perform  on  the  kn  spectra  destroys  the  scale  invariance  anyway.  Each 
plot  shows  a dotted  curve,  calculated  from  (5.25)  using  the  measured  power  for  that 
simulation  run.  For  reference,  the  ki/k„i  = 0.625  case  also  shows  the  prediction  using 
the  ko  filtered  fl)  as  the  dashed  curve.  These  runs  use  Pq  ~ k",  kg  a k„i  = 32, 
and  we  take  measurements  from  one  realization.  For  ii/inj  S,  0.5,  the  Q signal 
generally  becomes  too  noisy  to  be  useful.  This  is  illustrated  in  the  n = 0 and  n = 1 
cases,  Figures  5-37  and  5-38.  After  evolving  the  simulations  to  k„i  = 16,  we  show  in 
Figure  5-39  the  ki/k„t  = 0.625  results,  for  a sharp  kc  = 16.  The  results  are  similar 
to  those  above,  though  we  averaged  over  10  realizations  to  achieve  a clean  signal. 

The  question  naturally  arises  as  to  how  biasing  in  the  data  set  will  affect  the 
correlations.  Additionally,  it  is  important  to  know  whether  nonlinearities  unrelated 
to  bias  will  be  so  entangled  as  to  make  it  difficult  or  impractical  to  discern  the  bias 
unambiguously.  We  find  that  the  biased  power  obeys  the  expected  trend:  more  bios 
leads  to  greater  measured  power.  When  the  power  signal  is  quite  clear,  we  generally 
find  that  there  are  regions  (typically  in  the  linear  regime)  where  the  biased  power 
is  well  fit  by  the  linear  relation.  At  times,  a good  lit  for  a constant  Pt/P 
does  not  exist,  either  because  the  signal  is  too  noisy  or  simply  the  Ffc  and  P appear 


257 


8/ n 


Figure  5-34.  Measured  reduced  bispectrum  at  various  scales,  for  Po  ~ lc~2,  kg  n 
cV‘  = Dotted  curves:  Qp,  using  the  measure  power.  Dashed  curve:  Qw  using  the 
altered  /’i. 


= -1.6 


fl/tr 


Figure  5-35.  Measured  reduced  bispectrum  at  various  scales,  for  Pq  — kg  sw 
k„i  = 32.  Dotted  curves:  Qpi  using  the  measure  power.  Dashed  curve:  Qni  using  the 
filtered  ft. 


259 


0/n 


Figure  5-36,  Measured  reduced  bispectrum  at  various  scales,  for  /'ll  ~ A**1,  Icq  =a 
k„i  = 32.  Dotted  curves:  Qpt  using  the  measure  power.  Dashed  curve:  Qp t using  the 
filtered  P, 


260 


B/n 


Figure  5-37.  Measured  reduced  bispectrum  at  various  scales,  for  Po  ~ k°,  kG  a 
k„i  = 32.  Dotted  curves:  Qpl  using  the  measure  power.  Dashed  curve:  Q„,  using  the 
filtered  Pr,. 


2G1 


B/n 


Figure  5-38.  Measured  reduced  bispectrum  at  various  scales,  for  P0  ~ *l.  kG  =s 
StB*ed  fl)^01****  C"rVeS:  <^pt  “s‘ng  t*ie  racasure  P°*er-  Dashed  curve:  Qp,  using  the 


fl/n 


Figure  5-39.  Reduced  bispectrum  for  ki/k„i  = 0.625  and  sharply  cut  off  initial 
power,  averaged  over  10  realizations.  The  Qp,  curves,  using  measured  power,  arc  also 


263 

to  have  different  slopes.  We  find  the  bispectrum  signal  to  flatten  out  with  more 
biasing.  Typically,  the  llattening/shifting  according  to  the  nonlinear  biasing  effect, 
equation  (5.33).  provides  a useful  biasing  indicator  into  the  nonlinear  regime.  We 
present  data  in  support  of  these  claims  below. 

Two  types  of  threshold  biasing  are  applied.  For  sharp  threshold  bias,  cells  with 
i < it  are  emptied,  leaving  only  the  higher  density  regions  (so  i is  clipped  to  -1  for 
density  contrasts  below  it).  For  power  law  bias,  the  "galaxy”  probability  function  is 
V ~ (1  + J)“  below  a threshold  6,.  The  latter  prescription  is  slightly  less  drastic,  and 
perhaps  more  realistic,  than  the  sharp  threshold  case.  We  include  both  for  comparison 
purposes.  Especially  with  the  sharp  threshold  we  can  achieve  a significant  bias,  such 
as  it  = 5 or  10,  even  within  the  limited  dynamic  range  of  the  simulations.  The  signal 
degradation  is  such  that  we  generally  use  it  & 6. 

Relatively  clean  biased  power  signals  are  shown  in  Figure  5-40,  obtained  from  10 
realizations  of  /-,)  — kn.  kc  — 16  simulations.  For  each  initial  spectrum,  sharp  and 
p>/2  thresholds  are  shown,  for  it  =3  and  6.  Fitting  to  the  bfP  form  for  k < k „/,  we 
show  the  linear  bias  parameter  b\  in  Figure  5-41  as  a function  of  it.  The  Jensen  and 
Szalay  (1986)  result  for  clipped  Gaussian  fields,  equation  4.39,  is  shown  as  the  dotted 
line  (we  use  ot  = 1 in  the  denominator).  The  functional  relationship  is  steeper  in  the 
Gaussian  case,  as  might  be  expected  since  clustered  data  certainly  have  a a > 1 non- 
Gaussian  tail.  There  is  not  much  it  dependence  in  the  V ~ p1^2  case,  since  for  it  ^ 1 
one  might  expect  more  dependence  on  the  exponent  a.  There  is  just  enough  evolution 
at  (;„(  = 32  in  our  simulations  to  use  kg  a 32  filtered  spectra  and  utilize  the  k < k„t 
signal.  Figure  5-42  shows  the  biased  power  for  one  realization,  again  for  two  power 
law  and  two  sharp  threshold  biases  (note  the  high  k problems).  Choosing  k„t  = 16 
generally  allows  enough  evolution  for  utilizing  several  biases  while  still  maintaining 
a significant  small  k linear  regime.  Particularly  clean  biasing  signals  are  shown  in 


Figure  5-43,  which  shows  the  power  for  20  realizations  of  a l\)  ~ i-1-5,  kc  = 16 
simulation.  With  a pa  threshold.  S,  =4,  we  show  the  unbiased  power  (the  lowest, 
solid  curve)  as  well  as  a = 0-1.  0.5,  1.  and  2 biased  samples  (the  dotted  curves). 
Additionally,  a sharp  <S|  = 10  sample  is  shown  (dashed  curve).  For  the  S,  = 10  case, 
we  also  show  the  signal  from  just  one  realization  (the  sample  had  only  N = 6000 
galaxies).  As  one  might  expect,  in  the  mildly  nonlinear  regime,  averaging  over  many 
realizations  (analogous  to  using  very  large  astronomical  data  sets)  really  does  narrow 
in  on  the  signal.  We  plot  the  linear  bias  parameter  versus  the  threshold  power 
index  a in  Figure  5-44. 

We  now  turn  to  results  concerning  nonlinear  bias  and  the  bispectrum.  In  general, 
the  results  are  quite  encouraging  in  the  mildly  nonlinear  regime,  as  the  effects  of 
biasing  are  detectable  using  the  results  of  second  order  perturbation  theory. 

Using  initial  power  indices  of  n = -2.  -1,  0,  and  1,  we  demonstrate  nonlinear 
biasing  effects  in  Figures  5-45  through  5-52.  We  use  k„i  = kc  = 16  simulations 
and  k i = 2*2  = 10,  so  the  bispectrum  Q(0)  will  be  probing  the  quasi-linear  regime. 
Measurements  were  obtained  from  one  realization  of  the  initial  conditions.  Figure  5- 
45,  for  n = —2,  shows  the  unbiased  reduced  bispectrum  (the  solid  curve  connecting 
the  data  points)  and  two  bispectra  for  two  different  biasing  schemes.  The  data  points 
from  the  V ~ p1/2.  5 < 3 prescription  are  well  fit  by  the  dotted  curve,  the  least 
squares  fit  of  the  measured  Qt  to  the  measured  Q using  the  nonlinear  biasing  form 
Q/b\  + &2/62.  Similarly,  the  data  points  for  the  sharp  S,  — 3 prescription  are  closely 
followed  by  the  short-dashed  curve.  Additionally,  we  plot  (long-dashed  curve)  the 
same  nonlinear  bias  fit  using  the  Qp,  generated  by  P ~ k~2  rather  than  the  measured 
Q.  This  emphasizes  that  the  nonlinear  biasing  effect  is  larger  than  the  nonlinear  effect 
which  slightly  increases  Q variation  (over  the  tree-level  Qp,  $ dependence)  for  n = -2. 
Using  the  measured  Q in  fitting  to  the  measured  is  utilizing  more  information  than 


Figure  5-10.  Biased  power  spectrum  measurements.  For  initial  k",  kc  = 16  spectra, 
we  show1  the  unbiased  power  and  the  power  from  four  biased  samples,  averaged  over 
10  realizations.  In  each  panel,  from  bottom  to  top  we  have:  unbiased  fsolid  curve), 
V p1^2  up  to  6i  = 3 and  6 (dotted  curves),  and  sharp  6t  = 3 and  6 thresholds 


Figure  5-41.  Linear  bias  parameter  6;  from  power  spectrum  measurements  For 
initial  kc  = 16  spectra,  we  plot  6,  versus  the  bias  threshold  S,.  The  circles  apply 
to  a sharp  threshold,  the  squares  toaP-  p1/2  scheme  below  6,.  The  dotted  line 
shown  in  the  n 1 case  is  the  analytic  result  for  the  clipping  of  a Gaussian  field. 


267 


Figure  5-42.  Biased  power  spectrum  measurements.  For  initial  k",  ka  a 32  spectra, 
we  show  the  unbiased  power  and  the  power  from  four  biased  samples,  averaged  over 
1 realization.  In  each  panel,  from  bottom  to  top  we  have:  unbiased  (solid  curve), 
^ ~L?y2  UP  ~ ^ an<^  6 (dotted  curves),  and  sharp  5*  = 2 and  6 thresholds 


268 


Figure  5-43.  Biased  power  spectrum  for  initial  fc-15.  kc  = 16  spectrum,  averaged 
over  20  realizations.  The  lowest  (solid)  curve  is  the  unbiased  spectrum.  The  dotted 
curves  correspond  to  power  law  (V  ~ p°)  thresholds  with  St  = 4.  From  bottom  to 
top:  a = 0.1,  0.5. 1.  and  2.  The  short  dashed  curve  the  sharp  clipping  St  = 10  signal, 
while  the  long-dashed  line  near  it  comes  from  one  dilute  realization. 


269 


Figure  5-44.  Linear  bias  parameter  b\  from  power  measurements  based  on  20  real- 
izations of  initial  A-1,5  spectra.  For  a power  law  threshold.  V ~~  p°.  below  6/  = 4, 
the  index  a is  varied. 


270 


would  be  available  in  a galaxy  survey.  Even  in  ideal  observational  data,  one  would 
only  have  a measured  P and  a measured  Q.  Given  these  spectra,  one  would  ask 
whether  they  were  consistent  with  no  biasing  or  whether  they  were  really  Pi  and 
Ql  with  measurable  bias  parameters.  VVe  utilize  the  full  information,  as  that  is  one 
the  main  benefits  of  a computer  simulation  approach  (we  do  intentionally  limit  our 
information  below,  when  we  demonstrate  the  viability  of  the  bispectrum  technique  in  a 
dilute  sample).  To  illustrate  the  biasing  detection  for  various  additional  prescriptions, 
we  plot  the  Q/b\  +c  fitting  parameters  in  Figure  5-46  as  a function  of  the  6,  threshold. 
The  hi  biasing  parameter  from  Qi  is  generally  comparable  to  that  obtained  from  Pi, 
though  typically  we  find  that  h\  > bp  when  n £ —1.5,  where  we  use  ip  to  denote 
the  Pi  = lipP  fitting  parameter.  Furthermore,  in  the  mildly  nonlinear  regime  we 
find  that  while  it  56  1 (accounting  for  the  1-tr  error  bars),  the  nonlinear  biasing  form 
Ql  hi  + h/b\  is  consistently  a much  better  fit  that  the  naive  Q/b\.  For  example, 
in  the  sharp  Si  = 3 biasing  for  n = -2,  the  power  spectra  gave  ip  = 3.25  ± 0.01 
with  r per  degree  of  freedom  *2/15  = 123.  The  strict  linear  Pb  = !?pP  fits  for 
k < k„i  arc  generally  not  good  by  this  criterion.  If  one  simply  uses  the  ratio  of  Pi 
to  P over  the  modes  sampled  by  the  bispectrum,  then  the  discrepancy  between  ip 
and  i]  still  exists  but  is  often  smaller.  The  nonlinear  biasing  fit  gives  i]  = 2.9  ±0.14, 
hj/hf  = 0.24  ±0.02  with  *2/8  = 0.34,  while  the  Q/b,  fit  gives  hi  = 2.00  ± 0.03  with 
X2/8  = 17.  While  in  the  mildly  nonlinear  regime,  the  Q/b\  fit  worsens  relative  to 
the  Q/b\  + h2/hf  fit  as  the  biasing  is  increased.  Similar  considerations  hold  for  the 
n = — 1 case,  Figures  5-47  and  5-48.  For  n = 0 and  n = 1,  Figures  5-49  through  5-52, 
the  signal  hss  degraded  yet  bias  is  detected. 

We  demonstrate  the  utility  of  large  data  sets  in  Figure  5-53,  where  we  average 
over  20  realizations  of  an  initial  fc-1-5,  kni  = kc  = 16  spectrum.  Two  cleanly  fit 
biased  bispectra  are  shown,  for  V ~ p1/2  prescription  (dotted  curve)  and  for  a 6,  = 10 


Figure  5-45.  Biased  bispcctrum  for  initial  power  index  n = -2.  The  unbiased  signal 
(solid  curve)  and  the  for  two  biasing  prescriptions  with  threshold  6,  = 3 are  shown. 
Open  squares:  bias  probability  V ~ p,/2.  Open  circles:  sharp  threshold.  Dotted  and 
dashed  curves:  best  fits  to  nonlinear  bias  form. 


272 


Figure  5-46.  Biasing  parameters  b\  and  c = 62/6?  for  power  index  n = -2,  as  a 
function  of  threshold  d|. 


273 


Figure  0-47 . Biased  bispectrum  for  initial  power  index  n = - 1.  The  unbiased  signal 
(solid  curve)  and  the  Qb  for  two  biasing  prescriptions  with  threshold  S,  = 3 are  shown. 
Open  squares:  bias  probability  V ~ pl/2.  Open  circles:  sharp  threshold.  Dotted  and 
dashed  curves:  best  fits  to  nonlinear  bias  form. 


Figure  5-48-  Biasing  parameters  61  and  c = 62/fr?  for  power  index  n = —1,  as  a 
function  of  threshold  it. 


275 


Figure  5-49.  Biased  bispectrum  for  initial  power  index  n = 0.  The  unbiased  signal 
(solid  curve)  and  the  Qt  for  two  biasing  prescriptions  with  threshold  /,  = 3 are  shown. 
Open  squares:  bias  probability  V - p1/2.  Open  circles:  sharp  threshold.  Dotted  and 
dashed  curves:  best  fits  to  nonlinear  bias  form. 


276 


and  c = 6a/6j  for  power  index  n = 0,  as  a 


Figure  5-50.  Biasing  parameters  5) 
function  of  threshold  it. 


Figure  5-51.  Biased  bispeclrura  for  initial  power  index  n = 1.  The  unbiased  signal 
(solid  curve)  and  the  Qb  for  two  biasing  prescriptions  with  threshold  = 3 are  shown. 
Open  squares:  bias  probability  V ~ p1/2.  Open  circles:  sharp  threshold.  Dotted  and 
dashed  curves:  best  fits  to  nonlinear  bias  form. 


Figure  5-46.  Biasing  parameters  &i  and  c = 62/6?  for  power  index  n = 1,  as  a 
function  of  threshold  Si. 


279 

(dashed  curve).  The  raihor  large  threshold  bias  demonstrates  how  one  may  find  a flat 
bispectrum  in  the  quasi-iinear  regime.  The  noulinear  biasing  fit  gives  6j  = 10  ± 1.1, 
&2/6|  = 0.52  ± 0.01  (x2/8  = 0.41).  Figure  5-54  shows  the  sensitivity  of  Qt,  to  the 
biasing  prescription,  as  we  set  a = 0.1.  0,3,  0.5.  0.7,  0.9,  1.0,  1.5,  and  2.0  for  a 
V ~ bias  scheme,  with  biasing  parameters  displayed  in  Figure  5-55.  This  degree 
of  sensitivity  is  potentially  useful  in  astronomical  data  sets,  to  the  extent  that  there 
was  certainty  in  that  one  was  working  in  the  quasi-iinear  regime. 

If  we  let  k\  > kni , ostensibly  probing  the  nonlinear  regime  with  the  bispectrum, 
then  a well  defined  3-point  signal  is  certainly  expected  since  the  matter  distribution 
is  far  from  Gaussian.  For  k\/k„t  values  of  2.  3.  4.  etc.  in  our  simulations,  we  are 
really  probing  the  shell  crossing  or  catastrophe/singularity  sector,  much  as  we  were 
in  observing  the  approximate  k~3  high  k tail  to  the  power  spectrum.  As  such,  this 
area  does  not  really'  mimic  gravitational  instability.  The  pancakes  (and  filaments) 
would  lead  to  the  now  familiar  preference  (enhanced  correlation)  for  collinear  triangle 
configurations  in  k-space.  We  show  this  in  Figure  5-56,  a representative  example  with 
fcl  = 24  = 3k„|.  Note  that  there  is  a range  where  Q is  flat  (similar,  coincidently,  to 
the  flatness  one  expects,  from  astronomical  observations  and  ,V-body  simulations, 
in  the  very  highly  nonlinear  regime).  The  measured  signal  (solid  curve  in  Figure  5- 
56)  is  not  closely  tied  with  the  perturbative  result  (dashed  curve,  obtained  from  the 
measured  power).  The  biased  bispectrum,  however,  is  still  well  fit  to  the  Q/b\  + c 
form  (dotted  curve),  giving  6]  = 4.2  ± 0.2,  c = 0.13  ± 0.03. 

We  conclude  this  chapter  with  results  pertaining  to  dilute  samples.  Using  only 
the  measured  power  spectrum  and  bispectrum  of  a sparse  sample,  we  demonstrate  the 
degradation  of  signal  with  decreasing  N (the  number  of  galaxies)  and  the  detection  of 
bias.  In  Figures  5-57  and  5-58  we  show  power  and  bispcctrum  measurements  utilizing 
only  the  information  from  iV  as  104  and  N as  103  daw  sets.  We  use  P0  - AT1-5, 


l 


Figure  5-53.  Biased  bispectrum  for  initial  power  index  n = -1.5.  The  unbiased 
signal  (solid  curve)  and  the  Q/,  for  two  biasing  prescriptions  are  shown,  averaged  over 
20  realizations.  Open  squares:  bias  probability  V ~ p1/1,  ( < 6t.  Open  circles:  sharp 
threshold.  6t  = 10.  Dotted  and  dashed  curves:  best  fits  to  noniinear  bias  form. 


Figure  5-54.  Biased  bispectrum  for  initial  power  index  n = -1.5.  The  unbiased 
signal  (solid  curve)  and  the  Qj  for  eight  biasing  prescriptions,  V ~ pa,  6 < 6,  = 4. 
are  shown  (from  20  realisations).  The  a values  range  from  0.1  (top  dotted  curve)  to 
2.0  (bottom  dotted  curve). 


282 


Figure  5-55.  Biasing  parameters  by  and  c = b^/by  for  power  index  n 
function  of  index  o,  for  V *v  p°,  & < 6y. 


= —2, 


283 


Figure  5-56.  Bispectrum  in  nonlinear  regime,  *1  = 3 *y.  Solid  curve:  measured 
unbiased  Q.  Dashed  curve:  purturbative  unbiased  Q Dotted  curve:  best  fit  to 
nonlinear  bias  form  using  measured  Q, 


284 


knt  — kc  — 16  simulations.  ki  = 10,  and  a sharp  6i  = 4 bias  threshold.  The  full  set 
of  information  (averaged  over  20  realizations)  is  displayed  in  Figure  5-59,  with  the 
fitting  parameters  shown  in  Figure  5-60  as  a function  of  N.  The  biased  bispectrum 
signal  remains  robust  for  N £ 4 x 103.  as  shown  in  Figure  5-61.  for  a mildly  biased 
sample  probed  at  k g k„i- 


N = 10.089  galaxies 


Figure  0-57.  Detecting  bias  for  N ss  101  galaxies.  Using  only  the  measured  (biased) 
power  spectrum  and  bispectrum,  the  nonlinear  bias  parameters  6|  = 3.1  ± 0.9  and 
bn/af  = 0.30  ± 0.08  are  detected. 


286 


N = 980  galaxies 


6/n 


Figure  5-58.  Detecting  bias  for  N a 103  galaxies.  Using  only  the  measured  (biased) 
power  spectrum  and  bispectrum,  the  nonlinear  bias  parameters  6j  = 2.1  ± 1.0  and 

62/61  = 0.1  ± 0.2  are  detected. 


287 


Figure  5-5B.  Detecting  bias  under  ideal  conditions.  The  initial  spectrum  is  P0  ~ 
' > *nl  = 1(i,  as  in  Figures  5-57  and  5-58,  though  the  data  have  been  averaged  over 
20  realizations  (each  containing  N a 105  galaxies).  Fitting  the  biased  bispectrum 
gives  6i  = 4.1  ± 0.2  and  &2/&1  = 0.36  ± 0.02  using  the  measured  Q and  Qh 


Figure  5-60.  Biasing  parameters  for  sparse  samples.  The  nonlinear  bias  parameters 
6j  and  62/61  are  measured  as  the  number  of  galaxies  .V  is  reduced. 


289 


Figure. 


CHAPTER  6 
FINAL  REMARKS 

Throughout  this  study,  our  goal  has  boon  to  illustrate  the  nature  of  large-scale 
structure  in  the  universe  by  utilizing  a perturbative  approach  to  gravitational  insta- 
bility. The  applicability  of  a relatively  simple  expanding  fluid  model  is  somewhat  of  a 
luxury,  as  the  nonlinear  nature  of  the  large-scale  structure  problem  could  conceivably 
have  limited  perturbation  theory  to  a very  minor  role  (though  the  smallness  of  the 
measured  AT /T  =5  10~6  microwave  background  fluctuations  provides  a solid  founda- 
tion for  such  an  approach).  Perturbation  theory  is  not  the  whole  answer,  but  on  large 
scales  it  may  allow  us  to  understand  our  part  of  the  universe  and  how— or  if— this 
neighborhood  fits  within  some  as  yet  undetermined  robust  cosmological  model. 

The  standard  model  of  cosmology  is  a broad  theory  which  utilizes  various  branches 
of  physics  and  which  pertains  to  potential  measurements  on  many  scales,  from  glob- 
ular clusters  and  galactic  halos  to  “fair  samples”  of  the  universe  containing  billions 
of  galaxies.  Within  cosmology,  we  have  studied  large-scale  structure,  a field  which 
generally  involves  galaxy  formation,  gravitational  instability,  the  nature  of  dark  and 
luminous  cosmic  matter,  and  the  description  of  the  large-scale  matter  distribution. 
These  ingredients  are  interrelated.  We  have  restricted  our  study  to  regions  which  are 
large  enough  that  gravity  plays  the  dominant  role  but  which  fall  within  the  subhori- 
zon, slow  motion  limit  of  general  relativity.  This  is  not  a severe  restriction,  in  that 
a Newtonian  approximation  may  apply  from  galaxy-sized  regions  to  billion-light-year 
regions  which  may  comprise  fair  statistical  samples  of  the  universe.  We  applied  the 
expanding  fluid  model  in  the  linear  and  mildly  nonlinear  regimes  to  attack  a very 
complex  problem  in  a controlled  fashion.  Our  approach,  which  utilized  the  efficient 
290 


291 

Lagrangian  mappings  introduced  by  Zel'dovich  (1970),  is  complementary  to  impor- 
tant methods  involving  full  W-body  simulations.  The  perturbative  approach  to  the 
equations  of  motion  has  been  expounded  by  Lifshitz  (1946),  Zel’dovich  (1970),  Pee- 
bles (1980),  Fry  (1984),  Goroff  et  al.  (1986),  Bertschinger  (1992).  and  others.  The 
Zel'dovich  approximation  (Zel’dovich  1970,  Shandarin  & Zel'dovich  1989)  is  not  the 
same  as  the  exact  gravitational  instability  theory,  though  it  has  its  own  perturbation 
theory  (Grinstcin  & Wise  1987),  similar  in  structure  to  the  exact  perturbative  expan- 
sion (but  with  simpler  kernels),  and  this  has  allowed  us  to  probe  the  development  of 
structure  in  an  environment  which  shares  many  properties  with  the  full  dynamical 

Structure  can  be  described  both  statistically  and  visually.  In  Chapter  1,  we 
illustrated  the  visual  aspects  of  large-scale  structure  in  an  elementary  treatment  of 
the  subject.  There  are  basic  aspects  of  the  expanding  background  model  which  allow 
a simplified  approach  to  work  on  large  scales.  The  metric  expansion  factor  grows, 
to  a good  approximation,  according  to  a power  low  o(t)  — t^.  The  radiation  to 
matter  domination  transition  occurs  close  to  the  time  of  decoupling,  after  which 
cosmologically  interesting  structures  are  linearly  unstable  with  approximate  power 
law  growth  <5(x,  t)  — 6tj(x)b(t)  ~ a(t).  The  structures  which  arise  may  be  analyzed 
iu  various  ways.  Correlation  statistics,  pioneered  by  Peebles  (1980)  and  others,  have 
proven  useful  in  the  past  and  still  provide  one  of  the  main  descriptive  tools.  These 
ideas,  which  set  the  context  for  the  present  study,  were  addressed  in  Chapter  2. 

The  Lagrangian  approach  to  perturbation  theory,  where  material  coordinate  dis- 
placements, rather  than  spatial  fields,  provide  the  perturbative  parameters,  was  de- 
scribed in  Chapter  3.  The  important  points  of  the  Zel’dovich  approximation  for  our 
purposes  were  its  pancaking  properties  and  the  common  ground  it  shared  with  the 
exact  theory. 


Simulation  methods  were  described  in  Chapter  4.  as  we  investigated  various  ini- 
tial conditions  and  the  onset  of  nonlinearity.  The  power  spectrum  P(fc),  the  spatial 
2-point  correlation  function  in  Fourier  space,  characterised  the  initial  Gaussian  den- 
sity fluctuation  field.  Small  scale  filtering  (the  initial  power  i%(i)  suppressed  for 
k £ k„i)  allowed  for  a hierarchy  of  pancaking  (with  coherence  length  — 1/jfc^  at  each 
epoch).  This  was  illustrated  in  one,  two,  and  three  dimensions,  though  we  later  em- 
phasised three  dimensions.  Generally,  our  simulation  results  fall  into  two  categories: 
(a)  tests  for  which  we  have  an  analytic  result  and  thus  with  which  we  can  verify 
the  normalization  of  our  simulations  and  evaluate  the  dynamic  range  of  our  methods 
and  (b)  numerical  explorations  of  the  nonlinear  or  quasi-linear  regimes.  We  showed 
the  agreement  of  the  simulations  with  the  linear  theory  of  local  density  fluctuations, 
a ={is)'™  si  ai  ~ 6(f)  for  a « 1.  Shell  crossing  in  the  Zel’dovich  approximation 
caused  the  fluctuations  to  stop  growing  at  tr/  is  1.5,  a robust  result  for  Pq  ~ k" 
spectra  with  1 > n > -3.  We  found  that  <r(  as  1 was  a good  criterion  for  placing 
the  simulation  in  the  mildly  nonlinear  regime,  os  it  was  late  enough  that  nonlinear- 
ities were  important  yet  early  enough  that  perturbation  theory  was  still  applicable. 
Statistical  applications  in  Chapter  5 emphasised  the  powor  P(k)  and  the  reduced 
bispectrum  Q(0)  (Peebles  1980,  Fly  & Seldner  1982,  Fry  1994b),  which  is  just  now 
being  recognized  as  a practical  and  potentially  important  statistic.  The  bispectrum, 
sensitive  to  the  anisotropic  nature  of  gravitational  instability,  is  the  first  correlation 
function  devoted  to  non-Gaussian  features  and  nonlinear  evolution.  We  were  able  to 
verify  perturbative  effects  in  power  spectrum  measurements,  such  as  the  first  order 
growth  law  P ~ f>2(()  and  the  second  order  P ~ 64(f).  For  a top-hat  initial  spectrum, 
we  were  able  to  directly  observe  the  second  and  third  order  mode  coupling  in  three 
dimensions,  though  with  limited  dynamic  range.  To  proceed  further,  we  observed  the 
effect  strikingly  in  one-dimensional  simulations,  measuring  the  P ~ &2(n-l)  gro^ii 


293 

for  the  n-loop  perturbative  corrections.  Tile  3-loop  tr2  calculations  of  Scoccimarro 
and  Frieman  (1996)  were  verified  for  <7  < 1 in  3-d  simulations.  We  find  that  at  times 
perturbative  correlation  results  appear  applicable  surprisingly  far  into  the  nonlinear 
regime,  particularly  for  the  bispectrum.  The  reduced  bispectrum  Q(0)  showed  the 
characteristic  enhancements  for  collinear  A'l  and  k^,  which  would  be  expected  for 
pancaking.  But  much  more  impressively,  the  second  order  perturbative  result  (Fry 
1984)  applied  for  scales  with  jfci  < k„i,  as  might  be  expected,  but  it  also  applied 
for  fci  Rr  particularly  if  we  used  the  practical  technique  of  inserting  the  mea- 
sured P(k)  into  the  perturbative  bispectrum  expression.  The  predicted  power  index 
n-dependence  of  Q persisted  in  the  mildly  nonlinear  regime.  On  top  of  this  primary 
effect,  the  configuration  dependence  deepened  for  n S -1.5  and  the  signal  flattened 
slightly  for  n it  —1.5.  The  nonlinear  biasing  effect  (Fry  & Gaztanaga  1993)  proved 
to  be  quite  robust,  detecting  bias  in  the  Q signal  during  all  conditions  accessible  to 
our  simulations. 


REFERENCES 


p "■  “■  * “ "““ 

Gorotr,  M.  H.,  Grinstcin,  B„  Rey,  S.  -J.,  & Wise,  M.  B.  1986,  ApJ,  311,  6 
c- t"  E L' 


^i4XcSr‘““PW-^p-^<“”k' 


L5 


K°'b..E  W k Turner,  M.  S.  1990,  The  Early  Universe  (Redwood  City,  CA:  Addison- 


“"SS^SM -S.S»°”  vu-K 


3.  A. 


MOU^2d377  ■'  Aiinli'  ‘ B0UChSt'  F R"  Peilat'  !l  h RaraBni'  A.  1991,  ApJ, 


C.  D„  k Wirtancn,  C.  A.  1067,  Publ.  Llek  Obo„  22,  Part  1 


“StA:  KSba«.K!£a"  °-  ‘-■a. «■*-.  «-  ‘ 


SiSiSiS 


^ssawar-  - “ “- 
^aiiaag'aa; 


BIOGRAPHICAL  SKETCH 


a B.S.  in  physics  from  M.I.T.  in  Cambridge,  Massachu 


300 


s&fgsfe-n- 


£ 

pa*- 

SSBS^ 

sbfcf^ 

s^sssSSsrs-: 


